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Bootstrap percolation on the random graph Gn,p is a process of 
spread of "activation" on a given reahzation of the graph with a given 
number of initially active nodes. At each step those vertices which 
have not been active but have at least r > 2 active neighbors become 
active as well. 

We study the size A* of the final active set. The parameters of the 
model are, besides r (fixed) and n (tending to cxd), the size a — a(n) 
of the initially active set and the probability p — p{n) of the edges 
in the graph. We show that the model exhibits a sharp phase tran- 
sition: depending on the parameters of the model, the final size of 
activation with a high probability is either n — o{n) or it is o(n). We 
provide a complete description of the phase diagram on the space of 
the parameters of the model. In particular, we find the phase tran- 
sition and compute the asymptotics (in probability) for A"; we also 
prove a central limit theorem for A* in some ranges. Furthermore, 
we provide the asymptotics for the number of steps until the process 
stops. 



1. Introduction. Bootstrap percolation on a graph G is defined as the 
spread of activation or infection according to the following rule, with a given 
threshold r > 2: We start with a set .4(0) C V{G) of active vertices. Each 
inactive vertex that has at least r active neighbors becomes active. This 
is repeated until no more vertices become active, that is, when no inactive 
vertex has r or more active neighbors. Active vertices never become inactive, 
so the set of active vertices grows monotonously. 
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To avoid confusion, we will use the terminology that each active vertex 
infects all its neighbors, so that a vertex that is infected (at least) r times 
becomes active. 

We are mainly interested in the final size A* of the active set, and in 
particular whether eventually all vertices will be active or not. If they are, we 
say that the initial set ^(0) percolates (completely). We will study a sequence 
of graphs of order n — )■ oo; we then also say that (a sequence of) ^(0) almost 
percolates if the number of vertices that remain inactive is o(n), that is, if 
A* = n — o{n). 

Bootstrap percolation on a lattice (which is a special example of a cellu- 
lar automata) was introduced in 1979 by Chalupa, Leath and Reich [24] as 
a simplified model of some magnetic systems. Since then bootstrap perco- 
lation has been studied on various graphs, both deterministic and random. 
One can study either a random initial set or the deterministic problem of 
choosing an initial set that is optimal in some sense. A simple example of 
the latter is the classical folklore problem to find the minimal percolating set 
in a two-dimensional grid (i.e., a finite square [n]^ in the square lattice); see 
Balogh and Pete [13] and Bollobas [18]. (These references also treat higher- 
dimensional grids [n]"'.) Another extremal problem is studied by Morris [39]. 
The problem with a random initial set was introduced by Chalupa, Leath 
and Reich [24] (lattices and regular infinite tree), and further studied on 
lattices by Schonmann [42]; it has, in particular, been studied on finite grids 
(in two dimensions or more), see Aizenman and Lebowitz [1], Balogh and 
Pete [13], Cerf and Cirillo [20], Cerf and Manzo [21], Holroyd [29], Balogh, 
Bollobas and Morris [9], Gravner, Holroyd and Morris [27]. In a recent paper, 
Balogh et al. [6] derived a sharp asymptotic for the critical density (i.e., the 
critical size of a random initial set) for bootstrap percolation on grids of any 
dimension, generalizing results of Balogh, Bollobas, and Morris [8]. Grids 
with a different edge set where studied by Holroyd, Liggett and Romik [30]. 
The study of bootstrap percolation on lattices is partly explained by its ori- 
gin in statistical physics, and the bootstrap process is being successfully used 
in studies of the Ising model; see [22, 23, 26, 40]. Lately bootstrap percola- 
tion has also been studied on varieties of graphs different from lattices and 
grids; see, for example, Balogh and Bollobas [5] (hypercube); Balogh, Peres 
and Pete [12] (infinite trees); Balogh and Pittel [14], Janson [32] (random 
regular graphs); an extension where the threshold may vary between the 
vertices is studied by Amini [2]. An anisotropic bootstrap percolation was 
studied by Duminil-Copin and van Enter [25]. Further, a graph bootstrap 
percolation model introduced by Bollobas [17] already in 1968, where edges 
are infected instead of vertices, was analyzed recently by Balogh, Bollobas 
and Morris [10] and Balogh et al. [11]. 

In the present paper, we study bootstrap percolation on the Erdos-Renyi 
random graph Gn,p with an initial set ^(0) consisting of a given number a 
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of vertices chosen at random. (By symmetry, we obtain the same results for 
any deterministic set of a vertices.) Recall that Gn,p is the random graph on 
the set of vertices Vn = {I, ■ ■ ■ ,n} where all possible edges between pairs of 
different vertices are present independently and with the same probability p. 
As usual, we let p = p{n) depend on n. 

A problem equivalent to bootstrap percolation on G„,p in the case p = \/n 
was studied by Scalia-Tomba [41], although he used a different formulation 
as an epidemic. (Ball and Britton [3, 4] study a more general model with 
different degrees of severity of infection.) Otherwise, bootstrap percolation 
on Gn,p was first studied by Vallier [47]; we here use a simple method (the 
same as [41]) that allows us to both simplify the proofs and improve the 
results. We will state the results for a general fixed r > 2 (the case r = 1 is 
much different; see Remark 5.9); the reader may for simplicity consider the 
case r = 2 only, since there are no essential differences for higher r. 

We will see that there is a threshold phenomenon: typically, either the final 
size A* is small (at most twice the initial size a), or it is large [sometimes 
exactly n, but if p is so small that there are vertices of degree less than r, 
these can never become active except initially so eventually at most n — 
o(n) will become infected]. We can study the threshold in two ways: in the 
first version, we keep n and p fixed and vary a. In the second version, we 
fix n and a and vary p. We will state some results for both versions and 
some for the former version only; these too can easily be translated to the 
second version. We will also study dynamical versions, where we add new 
external infections or activations or new edges until we reach the threshold; 
see Section 4. 

Apart from the final size A*, we will also study the time r the bootstrap 
process takes until completion. We count the time in generations: genera- 
tion is ^(0), generation 1 is the set of other vertices that have at least r 
neighbors in generation 0, and so on. The process stops as soon as there is an 
empty generation, and we let r be the number of (nonempty) generations. 
Thus, if we let be the set of vertices activated in generation /c, then 

(1.1) r := max{A; > : / 0} = min{A; > 1 : = 0} - 1. 

Remark 1.1. Bootstrap percolation does not seem to be a good model 
for usual infectious diseases; see, however. Ball and Britton [3]. It might be 
a better model for the spread of rumors or other ideas or beliefs; cf. the 
well-known rule, "What I tell you three times is true" in Carroll [19]. 

Bootstrap percolation can be also viewed as a simplified model for propa- 
gation of activity in a neural network. Although related neuronal models are 
too involved for a rigorous analysis (see, e.g., [36, 44, 46]) they inspired study 
of bootstrap percolation on Gn,p by Vallier [47]. There is a further discus- 
sion on the application of bootstrap percolation on Gn.p to neuromodelling 
in [45]. 
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Remark 1.2. Instead of Gn,p, one might consider the random graph 
G{n, m) , with a given number m = m[n) of edges. It is easy to obtain a result 
for G{n,m) from our results for Gn,p, using monotonicity, but we usually 
leave this to the reader. [In the dynamical model in Section 4.3, we consider 
G{n,m), however.] 

Remark 1.3. An alternative to starting with an initial active set of 
fixed size a is to let each vertex be initially activated with probability a = 
a{n) > 0, with different vertices activated independently. Note that this is 
the same as taking the initial size a random with a £ Bin(n,a). For most 
results the resulting random variation in a in negligible, and we obtain the 
same results as for a = na, but for the Gaussian limit in Theorems 3.6(iii) 
and 4.5, the asymptotic variances are changed by constant factors. We leave 
the details to the reader. 

Some open problems arise from our study. In [9], Balogh, Bollobas and 
Morris determine the critical probability for bootstrap percolation on grids 
when the dimension d = d{n) oo. A similar idea translated to the G{n,p) 
graph would be to study what happens when r = r{n) — )• oo. This problem 
is not treated here although our methods might be useful also for such prob- 
lems. The problem of majority percolation where a vertex becomes activated 
if at least half of its neighbors are active \r{v) = d{v)/2] has been studied 
on the hypercube by Balogh, Bollobas and Morris [7]. On the d-dimensional 
grid d{v) /2 = d but on the G{n,p) graph, this problem is completely different 
and still open. (We thank the referee for these suggestions.) 

The method is described in Section 2. The main results are stated in 
Section 3, with further results in Sections 4 and 5. Proofs are given in Sec- 
tions 6-11. 

1.1. Notation. All unspecified limits are as n — )• oo. We use — ^ for con- 
vergence in distribution and — ^ for convergence in probability of random 
variables; we further use Op and Op in the standard sense (see, e.g., [33] 
and [34]), and we use w.h.p. (with high probability) for events with prob- 
ability tending to 1 as n— )-oo. Note that, for example, "= o(l) w.h.p." is 
equivalent to "= Op(l)" and to "— 0," and that "~ a„ w.h.p." is equiva- 
lent to "= (1 + Op{l))an'" see [33]. A statement of the type "when V, then 
w.h.p. Q" (or similar wording), where V and Q are two events, means that 
¥{V and (not Q)) 0, that is, that w.h.p. "(not V) or Q" holds. (See, e.g.. 
Theorem 3.2 and Proposition 10.10.) If ¥{V) is bounded away from 0, this 
is equivalent to "conditioned on V, Q holds w.h.p." 

If Xn is a sequence of random variables, and /i„ and o"^ are sequences of 

real numbers, with cr^ > 0, we say that Xn £ AsN(/i„, o"^) if (X„ — /i„) / fT„ 
iV(0,l). 
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Occasionally we use the subsubsequence principle ([34], page 12), which 
says that to prove a limit result (e.g., for real numbers, or for random vari- 
ables in probability or in distribution), it is sufficient to show that every 
subsequence has a subsubsequence where the result holds. We may thus, 
without loss of generality, select convenient subsequences in a proof, for 
example, such that another given sequence either converges or tends to oo. 

2. A useful reformulation. In order to analyze the bootstrap percolation 
process on Gn,p, we change the time scale; we forget the generations and 
consider at each time step the infections from one vertex only. Choose ui G 
^(0) and give each of its neighbors a mark; we then say that ui is used, and 
let Z{1) := {ui} be the set of used vertices at time 1. We continue recursively: 
at time t, choose a vertex ut G A{t — 1) \ 2[t — 1). We give each neighbor 
of Ut a new mark. Let AA{t) be the set of inactive vertices with r marks; 
these now become active, and we let A(t) = A{t — 1) U AA{t) be the set of 
active vertices at time t. We finally set Z{t) = Z(t — 1) U {ut\ = {us - s <t}, 
the set of used vertices. [We start with Z{0) = 0, and note that necessarily 
AA{t) = for t < r.] 

The process stops when A{t) \ Z{t) = 0, that is, when all active vertices 
are used. We denote this time by T, 

(2.1) T:=mm{t>O:A{t)\Z{t) = 0}. 

Clearly, T < n; in particular, T is finite. The final active set is A{T); it is clear 
that this is the same set as the one produced by the bootstrap percolation 
process defined in the Introduction; only the time development differs. Hence 
we may as well study the version just described. [This is true for any choice 
of the vertices ut- For definiteness, we may assume that we keep the unused, 
active vertices in a queue and choose ut as the first vertex in the queue, and 
that the vertices in AA{t) are added at the end of the queue in order of 
their labels. Thus ut will always be one of the oldest unused, active vertices, 
which will enable us to recover the generations; see further Section 10. In 
Section 4, we consider other ways of choosing ut-] This reformulation was 
used already by Scalia-Tomba [41] (for a more general model). It is related 
to the (continuous-time) construction by Sellke [43] for an epidemic process. 

Let A{t) := \A{t)\, the number of active vertices at time t. Since \Z(t)\ = t 
and Z(t) C A{t) for t = 0, . . . , T, we also have 

(2.2) T = mm{t > : A{t) =t}= mm{t > : A{t) < t}. 
Moreover, since the final active set is A(T) = Z(T), its size A* is 

(2.3) A* := A{T) = |^(T)| = \Z{T)\ = T. 

Hence, the set .4(0) percolates if and only if T = n, and ^(0) almost perco- 
lates if and only if T = n — o(n). 
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We analyze this process by the standard method of reveahng the edges 
of the graph Gn,p only on a need-to-know basis. We thus begin by choos- 
ing ui as above and then reveal its neighbors; we then find U2 and reveal its 
neighbors, and so on. Let, for i ^ Z{s), Ii{s) be the indicator that there is 
an edge between the vertices Ug and i. This is also the indicator that i gets 
a mark at time s, so if Mi{t) is the number of marks i has at time t, then 

t 

(2.4) Mi{t) = Y,h{s), 

s=l 

at least until i is activated (and what happens later does not matter). Note 
that if i ^ ^(0), then, for every t<T,ie A{t) if and only if Mi{t) > r. The 
crucial feature of this description of the process, which makes the analysis 
simple, is that the random variables Ii{s) are i.i.d. Be(p). 

We have defined Ii{s) only for s <T and i ^ Z{s), but it is convenient to 
add further (redundant) variables so that Ii{s) are defined, and i.i.d., for all 
i £Vn and all s > 1. One way to do this formally is to reverse the procedure 
above. We start with i.i.d. Ii{s) G Be(p), for i G and s > 1, and a set 
-4.(0) C Vn- We let Z{0) := and start with an empty graph on Vn- We then, 
as above, for t = 1, . . . , n select ut G A{t— l)\Z{t — 1) if this set is nonempty; 
otherwise we select ut G Vn\Z{t— 1) (taking, e.g., the smallest such vertex). 
We define Mi{t) by (2.4) for ah i G K and t > 0, and update A{t) := A{0) U 
{i : Mi{t) > r} and Z{t) := Z{t - 1) U {uj = {u^ : s < t}. Furthermore, add 
an edge uti to the graph for each vertex i ^ Z{t) such that Ii{t) = 1. Finally, 
define T by (2.1) or (2.2). 

It is easy to see that this constructs a random graph Gn,p and that A{t), 
t <T, is as above for this graph, so the final active set of the bootstrap 
percolation on the graph is A{T). 

Define also, for i £Vn\ A{0), 

(2.5) Yi:=mm{t:Mi{t)>r}. 

If Yi <T, then Yi is the time vertex i becomes active, but if 1^ > T, then i 
never becomes active. Thus, for t <T, 

(2.6) A{t) = A{0)U{i^A{0):Yi<t}. 

By (2.4), each Mi{t) has a binomial distribution Bin(t,p). Further, by (2.4) 
and (2.5), each Yi has a negative binomial distribution NegBin(r,p), 

(2.7) P(y, = k) = F{M,{k - 1) = r - 1, /,(A:) = 1) = ^ 2 J ) /(I - p)^'"^; 

moreover, these random variables Yi are i.i.d. 
We let, for t = 0,1, 2,..., 

(2.8) Sit):=\{i^A{0):Y,<t}\= l{Yi<t}, 
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so, by (2.6), and our notation A{0) = a, 

(2.9) A{t)=A{0) + S{t) = S{t) + a. 

By (2.9), (2.2) and (2.3), it suffices to study tlie stochastic process S(t). 
Note tliat S{t) is a sum of n — a i.i.d. processes l{t > 1^}, eacli of wliicli 
is 0/1- valued and jumps from to 1 at time Yi, where Yi has the dis- 
tribution NegBin(r,p) in (2.7). We write S{t) = Sn-ait) when we want 
to emphasize the number of summands in S{t); more generally we de- 
fine Sm{t) ■= YlT^i — foi' <n [assuming for consistency that 
A{0) = {n-a + l,...,n}]. 

The fact that S{t), and thus A{t), is a sum of i.i.d. processes makes the 
analysis easy; in particular, for any given t, 

(2.10) Sit) eBm{n- a,TT{t)), 
where 

(2.11) 7r(t) := ¥{Yi <t)= P(Mi(t) > r) = P(Bin(t,p) > r). 
In particular, we have 

(2.12) ES{t) = {n-a)7r{t), 

(2.13) Var S{t) = {n - a)7r(t)(l - 7r(t)) < E S{t) < m:{t). 

To avoid rounding to integers sometimes below, we define S{t) := <S'([tJ) 
and 7r(t) := vr([tj) for all real t > 0. We also sometimes (when it is obviously 
harmless) ignore rounding to simplify notation. 



3. Main results. 



3.1. Limits in probability. For given r, n, and p we define, for reasons 
that will be seen later. 



(3.1) t, :-- 



np' 



(3.2) a,:={l--]t„ 



(3.3) 6.:=n(^")^' 



r-l 



(r- 1)! 

In particular, for r = 2, t^ := l/(np^) and := l/(2np^). For future use, 
note also that (3.1) can be written 

(3.4) JJ^f^k, 

r\ r 
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Our standard assumptions p <^ n~^^^ and p ^ imply that 

(3.5) 

ttc — )• oo, Oc/n— )-0, bc/n^O, pbc^O, 

and further 

(3.6) E5„(ic) = n7r(tc)~n-^^ = ^, 

n — ESnin) = n(l — vr(n)) = nP(Bm(n,p) < r — 1) 

(3.7) ~nP(Bin(n,p) = r-l) 

If p < then (1 - p)" ~ e-"P and (3.7) yields 

(3.8) n-ESnin)=n{l-Tr{n))r^b'^r^bc; 

if p is larger [p= 0(n~^/^), i.e., n~^^'^ = 0{p)], this is not quite true, but in 
this case both b'^ and 6c decrease to very fast; in all cases 

(3.9) n - E5„(n) = n(l - 7r(n)) = 6c + o(6c + 1). 

Recall that our main interest is in S{t) = Sn-a{t) rather than Snit); 
see (2.10); for S{t) we obviously have similar results, with additional er- 
ror terms depending on o; see (2.12) and, for example, (8.2). 

Note further that by (3.3), for any /3 G (—00,00), 

r -00, j-oo, 

nj5- (logn + (r- l)log(nji)) < /?, <S=^ 6c < ('^ - l)'~^e"'^, 

loo, lo, 

which by simple calculations yields, provided p'>n~^, 

( -00, 

np — (logn + (r — 1) log log n) — )• < /3, 

I 00, 

(3.10) 

foo, 

^ 6c ^ <^ {r-l)\-^e-^, 

lo. 

Our first result, to be refined later, shows that the threshold for almost 
percolation is a = a^. The proof of the theorems in this section are given 
later (Sections 8-10). Let us recall that A* is the final size of the active set, 
and that A* = T = A{T) = a + Sn-a{T). 

Theorem 3.1. Suppose thatr>2 and <^p<^n~^^^ . 
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(i) // a/oc — a < 1, then A* = {^p{a) + Op(l))tc; where (p{a) is the 
unique root in [0, 1] of 

(3.11) rip{a) — ip{aY = {r — l)a. 

[For r = 2, ip{a) = 1 - VT^.] 

Further, A* /a 971(a) := p^9j(a)/a, with (/?i(0) := 1. 

(ii) If a/ac>l + 6 , for some 5 > 0, then A* = n — 0p{n); in other words, 
we have w.h.p. almost percolation. More precisely. A* = n — Op(6c)- 

(iii) In case (ii), if further a < n/2, say, we further have complete per- 
colation, that is, A* = n w.h.p., if and only if — )• 0, that is, if and only if 
up — (logn + (r — 1) log log n) — )• 00. 

It is easily verified that ipi is a continuous, strictly increasing function 
[0, 1] — )• [l,r/(r — 1)]. In particular, in the subcritical case (i), we thus have 
w.h.p. A* < {r/{r — l))a < 2a, so the activation will not spread to many 
more than the originally active nodes. 

In the supercritical case (ii), we have the following more detailed result. 

Theorem 3.2. Suppose thatr>2, <^p <^n~^/''' anda = o{n), and 
that A* =n — 0p{n) as, for example, in Theorem 3.1(ii). Then: 

(i) // np — (logn + (r — 1) loglogJi) — )■ — 00, so be ^00 by (3.10), then 
A* = n — 6c(l + Op(l)). In particular, w.h.p. we do not have complete perco- 
lation. 

(ii) If up— (logn+ (r — 1) loglogn) — t- 00, so 6c by (3.10), then w.h.p. 
A* = n, so we have complete percolation. 

(iii) Ifnp— (log7T, + (r — 1) loglogn) — ;> /3g(— 00, 00), so be ^ b> by (3.10), 

then n — A* Po(6); in particular, F{A* = n) — t- exp(— 6) € (0, 1). 

More generally, even if we do not have almost percolation w.h.p., the result 
holds w.h.p. provided A* > 3tc- 

By the last statement we mean that P(the result fails and A* > 3tc) — ^ 0. 
In particular, it holds w.h.p. conditioned on A* > St^, provided we have 
liminfP(A* > 3tc) > 0. 

Remark 3.3. Let B be the set of vertices in Gn,p with degrees less 
than r. These are never activated unless they happen to be in the initially 
active set ^(0), and for each of the vertices, this has probability a/n— )■ if 
a = o(n); hence trivially ^* < n- |e|(l -Op(l)). We have [cf. (3.7) and (3.9)] 

E \B\ = nP(Bin(n - l,p) < r - 1) ~ 6c + o(6c + 1) 

with concentration of around its mean if 6c —t- 00 and a limiting Poisson 
distribution if 6c — )• 6 < 00; see [34], Sections 6.2 and 6.3, and [15]. Comparing 
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this with Theorem 3.2 we see that in the supercritical case, and with a = 
o(n), the final inactive set Vn \A{T) differs from B by Op(|;S|) vertices only, 
and in the case be = 0(1) [combining cases (ii) and (iii) in Theorem 3.2], 
w.h.p. Vn \ •^(T) = B. In other words, when we get a large active set, the 
vertices that remain inactive are mainly the ones with degrees less than r, 
and if further = 0(1), they are w.h.p. exactly the vertices with degrees 
less than r. 



We can, as discussed earlier, also consider thresholds for p for a given a. 

Theorem 3.4. Suppose that r > 2 and that a — >■ cx) with a = o{n). Then 
the threshold for p for almost percolation is 

(3.12) p.:^( '-^>;:r'" )'^„.-)-^ 

in the sense that if, for some 6 > 0, p< {l — 6)pc, then A* <2a = o{n) w.h.p., 
while if p> (1 + 5)pc, then A* = n — o{n) w.h.p. In the latter case, further 
A* =n w.h.p. if and only if p= (logn + (r — 1) log log n + uj{n)) /n for some 
uj{n) — )• oo. 

Note that n~~^ <^Pc^ n~^/'^ . Equation (3.12) is the inverse to (3.2) in the 
sense that the functions p^ and p i— )• Oc that they define are the inverses 
of each other. For r = 2, (3.12) simplifies to pc = (2na)~^/^. 

Remark 3.5. Note that the thresholds for complete and almost perco- 
lation are different only for large a. Indeed, for such a case the threshold pc 
for almost percolation can be so small that the graph G{n,pc) may not 
be even connected. Then, besides pc, we have the second threshold for the 
complete percolation; for example, if a = n/logn and r = 2, there are two 
thresholds: pc = @{\^logn/n) for almost percolation, and 0(log(n)/n) for 
complete percolation. If a is small enough so that G{n,pc) is dense enough 
(e.g., if a < 0.49n/log^n when r = 2), these two thresholds coincide. 

3.2. Gaussian limits. To study the threshold at Oc more precisely, we 
approximate 7r(t) in (2.11) by the corresponding Poisson probability, 

(3.13) n{t) := P(Po(tp) > r) = 7/;(tp) := ^e'^'*. 

j=r ^' 



Note that ^/j is a differentiable, increasing function on (0,oo), and that 
(3.14) = p^'ipt) = = ^e-*. 
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By a standard estimate for Poisson approximation of a binomial distribution 
(see, e.g., [15], Theorem 2.M), 

(3.15) \Tr{t) - fr{t)\ < dTv(Bm{t,p),Po{tp)) < p, 

where (ixv denotes the total variation distance. A sharper estimate for 
small t will be given in Lemma 9.4. 
We define, for given n and p, 

/o^ft^ * . n%{t)-t 
3. lb) := — mm -— , 

^ ' " t<3tc 1 - ^{t) 

and let t* G [0, 3tc] be the point where the minimum is attained. Under our 
standard assumptions <C n"^/*", for t < 3tc, when pt — )• by (3.5), 

we have, by (3.13) and (3.4), 

(3.17) 7^7^(0-^^*^'-^*^'*^ 



r! V* 

and thus tt (t) — )• and 1 — 7f(t) ~ 1; it follows easily that o* ~ Oc and t* ~ tc- 
More precise estimates are given in Lemma 9.5, where it also is shown that 
is unique (for large n, at least). Furthermore, by Lemma 9.1 below, for 
large n, the minimum in (3.16) could as well be taken over t < n/2, say, 
since nTf{t) — t > for t G [3tc,n/2]. 

The following theorem shows that the precise threshold for a is a* it 
0{^/a^), with a width of the threshold of the order ^/a^^ V^- ^ denotes 
the standard normal distribution function. Note that Theorem 3.2 applies, 
provided a = o{n), and provides more detailed information on A* when A* 
is large [i.e., in (ii) and in (iii) conditioned on, say, A* > 3tc]. 

Theorem 3.6. Suppose that r>2 and <C p ^ n'^^/''' . 

(i) If{a — al)/^/a^ — — oo, then for every e > 0, w.h.p. A* <tl< tc{l + e) 
If further a/a* 1, then A* = (1 + Op(l))ic. 

(ii) If {a-a*)/y/a^^+oo, then A* = n - Op{bc) ■ 

(iii) // (a — a*^)/ y/aZ y & (—00,00), then for every e > and every 
b* > 6c with b* = o{n), 

¥{A* >n-h*)^ $((r - if'^y), 
G [(1 - e)tc, (1 + e)tS ^ 1 - <I>((r - if'^y). 

For the corresponding result when we keep a fixed and change p, we define, 
for given n and a, 

(3.18) 7(p) := inf {(n-a)fi-(t) -t}. 

t<n/2 

Since vr is an increasing function of p, ^{p) is increasing, with 7(0) = —n/2 
and, provided, for example, a = o(n), 7(1) = o(l) [attained at t = o(l)]. 
Given a = a{n) — )• 00 with a = o(n), there is thus (for large n) a unique p* 
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such that 

(3.19) 7(Pc*) = -«- 

We will see in Lemma 9.2 that p* ~Pc- It is easily verified that, for large n 
at least, a* = a 'y{p) = —a, and thus and a i— )■ p* are the inverses 

of each other. 

Theorem 3.7. Suppose r>2 and a — )• oo with a = o{n). 

(i) If {P/P*c - l)a^''^ -oo, then A* < ((r/(r - 1) + Op(l))a. If further 
p/p* 1, then A* = ((r/(r - 1) + Op(l))a. 

(ii) If {p/Pc — l)a^^^ — )■ +00, then A* = n — Op{n); if further np— (logn + 
(r — 1) log log n) — )• oo, then A* = n w.h.p. 

(iii) // {p/Pc — l)a^/^ — )• A G (—00,00), then for every e > 0, 

(3.20) F{A* > (1 - e)n) ^ ^>(r(r - ly^^^X), 



(3.21) pL4*G 



1 y ' V r- 1 



e \ a, I + e a 



l_$(r(r- 1)^1/2^). 



If further np — (logn + (r — 1) loglogn) — t- 00, then (3.20) can he replaced by 
¥{A* = n) $(r(r - 1)~^/^X). 

In the subcritical cases in Theorems 3.1(i) and 3.6(i), we also obtain 
a Gaussian limit for the size of the final active set. 

Theorem 3.8. Suppose r > 2 and <Cp ^ n~^^^ . Let be the small- 
est positive root of 

(3.22) {n-a)TT{t*) + a-t^ = 0. 

(i) // a/oc — )■ a G (0,1), then ~ ip{a)tc with ip{a) £ (0,1) given by 
(3.11), and A* e AsN{t^,ip2{a)tc), where ip2{a) := ip{aY {I - ipiaY''^)-'^ /r . 

(ii) // a/oc — )• 1 and also (a — a*^)/ y/a^ — )• —00, then t^ ~ tc, more pre- 
cisely 



2t 

(3.23) t. = t*-{l + oil))^l-^{a* 



and 



^*'''''K'- 2ir-mi-a/at) )- 



Remark 3.9. It follows from the proof that in both cases, for large n 
at least, t^ is the unique root of (3.22) in [0,t*]. In (i), also t^ < t^, so t^ 
is the unique root in [0,tc]- In (ii); this is not always true. By Lemma 9.5, 
still for large n, t* > tc and ~ ~ pi^/i"^ ~ !)• Ifi for example, r = 2 
and p = logn/n, then t* — tc^ nj log^ n, while a = a* — ^fn yields t\ — t^^ 
V2n^/'^/logn'^t*-tc. 
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3.3. The number of generations. In the supercritical case a — a* ^ 
when ^(0) w.h.p. almost percolates by Theorem 3.6, we have the follow- 
ing asymptotic formula for the number of generations until the bootstrap 
percolation process stops. 

Theorem 3.10. Suppose that r > 2, <^p <^ n"^/*" and a = o{n). 
Assume than a — a*^ yja^ [so that .4(0) w.h.p. almost percolates]. Then, 
w.h.p., 



(3.24) 



a 



+ i^ + Op(l). 

np 



This theorem is an immediate consequence of Propositions 10.1, 10.4, 10.7 
and 10.10 in Section 10. Moreover, these propositions show that the three 
terms [excepting the error term Op(l)] in the formula (3.24) are the numbers 
of generations required for three distinct phases of the evolution: the begin- 
ning including (possibly) a bottleneck when the size is about tc; a period 
of doubly exponential growth; and a final phase where the last vertices are 
activated. Note that each of the three terms may be the largest one. 

Example 3.11. Let p = with 1/r < a < 1, and suppose a = 0{a^). 
Then the third term in (3.24) is 0(1) and can be ignored while the second 
term is log log n/ log r + 0(1). If we are safely supercritical, say a = 2ac, 
then the first term too is 0(1) and the result is r ~ log log ?i/ log r w.h.p., 
dominated by the second term. 

If instead the process is only barely supercritical, with a = a* + Oc say, 
with 1/2 < /3 < 1, then the first term in (3.24) is Cn'^ with C > and the 
exponent 7 = • , which dominates the other terms. Note that the 
exponent here can be any positive number in (0, 1/4) (with 7~l/4 ifa~l 
and /? ~ 1/2, so the graph is very sparse and the initial set is minimal). 

Finally, if p = loglogn/n, say, so the graph is very sparse, and a = 2ac, 
then again the first term in (3.24) is 0(1), the second is O (log log log log n), 
while the third is log n/ log log n, which thus dominates the sum. 

Note that the second term is O(loglogn), and the third is o(logn) [and in 
many cases 0(1) so it can be ignored], while the first term may be as large 
as n^/^~°^^^ [although it too in many cases is 0(1)]. 



Remark 3.12. In the subcritical case, one could presumably obtain 
similar results for the number of generations until the process stops, but we 
have not pursued this topic here. 
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4. Dynamical models. We usually assume, as above, that a and p are 
given, but we can also consider dynamical models where one of them grows 
with time. 

4.1. Adding external activations. In the first dynamical model, we let n 
and p be given, and consider a realization of Gn,p- We start with all vertices 
inactive (and completely uninfected). We then activate the vertices (from 
the outside) one by one, in random order. After each external activation, 
the bootstrap percolation mechanism works as before, activating all vertices 
that have at least r active neighbors until no such vertices remain; this is 
done instantaneously (or very rapidly) so that this is completed before the 
next external activation. Let be the number of externally activated ver- 
tices the first time that the active set A is "big" in some sense. For example, 
for definiteness, we may define "big" as |^| > n/2. [It follows from Theo- 
rem 3.1 that any threshold |^| > cn for a constant cG (0,1) will give the 
same asymptotic results, as well as thresholds tending to or n sufficiently 
slowly. If np — (logn + (r — 1) log log n) — >• oo, we may also choose the con- 
dition = n, that is, complete percolation ^ = Then Aq is a random 
variable (depending both on the realization of Gn,p and on the order of ex- 
ternal activations). In this formulation, the threshold result in Theorem 3.1 
may be stated as follows. 

Theorem 4.1. Suppose that r >2 and <^ p <^ n~^/^. Then Aq/ 



Proof. The active set after a external activations is the same as the 
final active set A{T) in the static model considered in the rest of this paper 
with these vertices chosen to be active initially. Hence, for any given a, 
Aq < a if and only if bootstrap percolation with a initially active yields 
a big final active set. In particular, ii S > 0, then Theorem 3.1(i) implies that 
P(^o < (1 - S)ac) 0, while Theorem 3.1(ii) and (iii) imply that F(^o < 

More precisely. Theorem 3.6 yields a Gaussian limit. 

Theorem 4.2. Suppose that r >2 and n^^ <Cp <C n^^/''. Then Aq e 
AsN(a*,ac/(r-l)). 

Proof. Let x G (— cx),oo). Then, arguing as in the proof of Theorem 4.1 
but now using Theorem 3.6(iii) (with y = xj^Jr — 1), we find 





< X = P(^o < Oc + x^Ja^l(r-\)) $(x). 



□ 
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We have here for simphcity assumed that the external activations are done 
by sampUng without replacement, but otherwise independently of whether 
the vertices already are (internally) activated. A natural variation is to only 
activate vertices that are inactive. Let Aq be the number of externally acti- 
vated vertices when the active set becomes big in this version. Since a new 
activation of an already active vertex does not matter at all, A'q equals in the 
version above the number of externally active vertices among the first 
that are not already internally activated. Thus Aq — A'q is the number of 
external activations that hit an already active vertex. It is easily verified 
that this is Op(ac), and thus Theorem 4.1 holds for Aq as well; we omit the 
details. It seems likely that it is possible to derive a version of the Gaussian 
limit in Theorem 4.2 for Aq too, but that would require a more careful es- 
timate of Aq — Aq (and in particular its variance), which we have not done, 
so we leave this possibility as an open problem. 

Remark 4.3. One way to think about this dynamical model, where we 
add new active vertices successively and may think of these as being initially 
active, is to see it as a sequence of bootstrap percolation processes, one for 
each a = 0, 1, . . . , n; the processes live on the same graph Gn,p but have dif- 
ferent numbers of initially active vertices, and they are coupled in a natural 
way. In order to really have the same realization of Gn,p for different a, we 
have to be careful in the choice of the order in which we explore the vertex 
neighborhoods, that is, the choice oiut- [Recall that Gn,p is constructed from 
the indicators Ii{s) and the sequence (ut); see Section 2.] We can achieve 
this by first making a list L of all vertices in the (random) order in which 
they are externally activated. We then at each time t choose ut as an unused 
internally activated vertex (e.g., the most recent one) if there is any such 
vertex, and otherwise as the next unused vertex in the list L. 

This model makes it possible to pose now questions about the bootstrap 
percolation. For example, we may consider the critical process starting with 
exactly Aq initially active vertices (i.e., the first process that grows beyond 
the bottleneck and becomes big) and ask for the number of generations until 
the process dies out. Alternatively, we may consider the process starting with 
exactly Aq — 1 initially active vertices (i.e., the last process that does not 
become big) and ask for its final size. 

Such questions will not be treated in the present paper, but we mention 

that it is easily seen that the final size with Aq — 1 initially active vertices 

is tc(l + Op(l)) so that the final size jumps from about to about n with 

the addition of a single additional initial vertex. Furthermore, we conjecture 

that, under suitable conditions, the number of generations for the process 

1/3 

with ^0 initially active vertices is of order Oc (which is much larger than 
the number of generations for any fixed a; see Section 3.3). 
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4.2. Adding external infections. An alternative to external activations is 
external infections, where we again start with all vertices inactive and unin- 
fected, and infect vertices one by one from the outside, choosing the infected 
vertices at random (independently and with replacement); as before, r in- 
fections (external or internal) are needed for activation, and active vertices 
infect their neighbors. Let Jq be the number of external infections when the 
active set first becomes "big" (as in Section 4.1). (Thus, Jq is a random 
variable.) 

In the original model, each initially active vertex infects about np other 
vertices so the total number of initial infections is about npa; it is thus easy to 
guess that Jq ~ npuc - Indeed, this is the case as is shown by the next theorem. 
We cannot (as far as we know) directly derive this from our previous results, 
since the dependencies between infections in the two versions are slightly 
different, but it follows by a minor variation of our method; see Section 8. 
We believe that the result could be sharpened to a Gaussian limit as in 
Theorem 4.2, but we leave this to the reader. 

Theorem 4.4. Suppose that r>2 and <^p<^ nr^/"^ . Then Jo/{np x 



4.3. Adding edges. In the second dynamical model, n and a are given; 
we start with n vertices of which a are active, but no edges. We then add the 
edges of the complete graph Kn one by one, in random order. As in the pre- 
vious dynamical model, bootstrap percolation takes place instantaneously 
after each new edge is added. 

It is convenient to use the standard method of adding the edges at random 
times (as in, e.g., [31]). Thus, each edge e in Kn is added at a time Ue, 
where Ue are independent and uniformly distributed on [0, 1]. Then, at a time 
u G [0, 1], the resulting graph is Gn,p with p = u. (We use u to denote this 
time variable, in order not to confuse it with the time t used to describe the 
bootstrap percolation process.) 

Let the random variable M be the number of edges required to obtain 
a big active set A, where "big" is defined as in Section 4.1. 

Theorem 4.5. Suppose r >2 and a — t- oo with a = o{n). Then 




In particular, for 



r r = 2, we thus have Jq ~ npa^ = l/(2p). 




More precisely, 
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The proof is given in Section 9. 

Remark 4.6 (Coupling different p). The proof of Theorem 4.5 is based 
on using our earher results for a single p. We might also want to study the 
bootstrap percolation process for all p at once [or equivalently, in G{n,m) 
for all m at once], that is, with a coupling of the models for different p, for 
given n and a. As in Remark 4.3, this requires a careful choice of the order 
in which the vertices are inspected. We can achieve this by modifying the 
formulation in Section 2 as follows: 

When we have chosen a vertex ut, we reveal the times Utj that the edges 
from it appear; this tells us the neighborhood of ut at any time u. We begin 
by choosing ui,...,Ua as the initially active vertices. We then, after each 
choice Ut, t> a, calculate for each of the remaining n — t vertices the time 
when it acquires the rth edge to {wi, . . . , wj}, and let u^+i be the vertex such 
that this time is minimal. Then, fixing any time u = p, the chosen vertices ut 
will all be active until the first time that no unused active vertices remain, 
and the process stops. In this manner, we have found a choice of ui,n2, . . . 
that satisfies the description in Section 2 for all p 6 [0, 1] simultaneously. 

As in Remark 4.3, we can use this model to study, for example, the last 
"small" or the first "big" bootstrap percolation process, when we add edges 
one by one with a given set of initially active vertices. Again, we will not 
consider such questions in the present paper. 

5. Boundary cases. We have above assumed r > 2 and <C p <C n"^/*". 
In this section we treat the cases when these assumptions do not hold. Proofs 
are given in Section 11. 

We begin with the sparse case p{n) ^ c/n when and defined by (3.1) 
and (3.2) are of order n. (The exact values are no longer relevant, since 
they are based on approximations no longer valid.) This suggests that the 
interesting case is when a x n, that is, when a positive fraction of all vertices 
are initially active. Indeed, Theorem 5.2 shows that, w.h.p., if we start with 
a positive fraction of the graph, then the activation spreads to a larger part of 
the graph but does not reach almost all vertices; if, on the contrary, the size 
of the original set of activated vertices is negligible with respect to the size 
of the graph, then the activation does not spread to a positive fraction of the 
graph. Provided c is large enough, there is, as found by Scalia-Tomba [41], 
also in this dichotomy, or "phase transition," similar to Theorem 3.1, 

with a sudden jump from a "small" to a "large" final active set, although in 
this case all sets are of order n so the jump is not as dramatic as for larger p. 

Define, for x > 0, c > and 61 G [0, 1], 

f{x,c,e):={l- e)F{Po{cx) >r)+e-x 

(5.1) 
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j=r ^' 

(5.2) = 1 - X - {1 - e)F(Po{cx) < r - 1) 

(5.3) =i_^_(i_^)gl£^e— , 

j=o ■'■ 

and let xq{6) be the smallest root x > of 

(5.4) f{x,c,9) = Q- 

similarly, let xi{0) be the largest root in [0, 1] of this equation. 

Since /(O, c, 6*) = > and /(I, c, 61) = -(1 - 6')P(Po(c) < r - 1) < 0, there 
is always at least one root in [0, 1], and < xq{9) <xi{9) < 1; further < 
xo{e) < xi{e) < 1 when < 6* < 1 while xo(0) = and xo(l) = xi(l) = 1. We 
also define 



(Po(r-l)<r-2) _ Ei=o(^-l)Vj 



7-1 



(5.5) Cc - Cc(r) :- r + ^^^^^^ _ i) ^ ^ _ i) " ^ + _ ^y-i /(^ - 

Thus Cc(2) = 3, Cc(3) = 9/2, Cc(4) = 53/9. 

Lemma 5.1. (i) If < c< Cc, then (5.4) has a unique root x = xo(9) G 
[0, 1] for every 6 € [0, 1], and Xo{9) is a continuous strictly increasing func- 
tion of 9. 

(ii) If c> Cc, then there exists 9^ = 9^{c) and 9^ = 9^{c) with < 9^ < 
< 1 such that (5.4) has three roots in [0, 1] when 9 G {9~,9c), but a unique 
root when 9 G [0,9~) or 9 £ {9c, 1]; if 9 = 9~ > or 9 = 9c, there are two 
roots, one of them double. The smallest root xq{9) is strictly increasing and 
continuous on [0, 1] except at 9c where it has a jump from xq{9c) to xi{9c) > 
xq{Gc), where xi{9c) = X{i{9c+) ■= limg\g^ xq{9) is the other root for 9 = 0c- 
Furthermore, if 9 = 9c, then f{x,c,9) > for x G [0,xi{9)], and X(){9) is a 
double root. 

Theorem 5.2. Suppose that r >2, p ^ c/n and a ^ 9n for some con- 
stants c > and 9 >0. 

(i) If 9 = 0, that is, ifa = o{n), then A* / a ^-^ I . 

(ii) lfc = 0, that is, ifp=o{l/n), then A* /a ^^1. 

(iii) // < c < Cc, then A* /n — ^ xo{9), where xq{9) is the unique non- 
negative root of (5.4). 

(iv) If c> Cc and 9 ^ 9c{c) given by Lemma 5.1, then A* /n — ^ xq{9), 
where xq{9) is the smallest nonnegative root of (5.4). 

There is thus a jump in the final size at a = 9cn. Remark 11.1 shows how 
to find 9c. 
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Remark 5.3. ^c(c) and 9~{c) are decreasing functions of c. [9c{c) is 
strictly decreasing, while 0~[c) is constant for large c] Hence their largest 
value is, by the calculation in Remark 11.1, 

ei = di{r):=e,{c,) = e~{c,) 



rP(Po(r - 1) = r - 1) + P(Po(r - 1) < r - 2) ' 

Thus ei{2) = 1 - e/3, ^*(3) = 1 - eV9, 0*(4) = 1 - 2eV53. 

The threshold for 0~ = can be calculated too. For r = 2, 9~{c) = for 
c > e^/y, where = 1 + y + y^; numerically, this is c > 3.35091 .... 

Remark 5.4. We have here considered a given p~ c/n and varied a~ 
0n. If we instead, as in Theorem 3.4, take a given On for a fixed and vary 
c = pn, we have a similar phenomenon. Lemma 5.1 and Theorem 5.2 apply 
for every combination of 6 and c, and by considering the set of (c, 9) S 
such that (5.4) has two or three roots, it follows from Remark 5.3 that if 
9 > 9*, then A* /n -^-^ xq{c), where xq{c) is the unique root of (5.4) and 
thus a continuous function of c, while \i 9 <9*, then there is a range of c 
where (5.4) has three roots, and one value of c where the limit value xq[c) 
jumps from a "small" to a "large" value. Thus there is, again, a kind of 
phase transition. 

The following theorem shows that if we for simplicity take p = c/n, then 
the precise threshold for a in Theorem 5.2 (iv) is 9^n it 0(-y/n), with a width 
of the threshold of the order ^/n. 

Theorem 5.5. Suppose that r>2 and p = ^ with c > Cc fixed. Let 9^ = 
0c{c), xq = xo{9c) and xi = xi{9c) be as in Lemma 5.1; thus and xi are 
the two roots in [0, 1] of f{x, c, ^c) = 0, with xq < xi. 

(i) // (a — 9cn) / ^/n — — oo, then for any e > 0, w.h.p. A* <{\ + e)xon. 
If further a ~ 9cn, then A* = (1 + Op{l))x()n. 

(ii) // (a — 9cn) / ^Jn — +oo, then for any e > 0, w.h.p. ^* > (1 — e)xin. 
If further a ~ 9cn, then ^* = (1 + Op(l))xin. 

(iii) If {a — 9cn)/^/n ^ y £ (—00,00), then there exists a sequence En ^0 
such that 

F{A* G [(1 - £n)xin, (1 + en)xin]) $(y/a), 
FiA* G [(1 - £n)xon, (1 + en)xon]) ^ 1 - $(y/cj), 
where cr^ = (1 - 9c)tlj{cxo)/{l - Tp{cxo)) > 0. 

At the other, dense, endpoint of our range we have p{n) ~ cn~^/^. Then tc 
and Oc in (3.1) and (3.2) are of order constant. (Again the exact values are 
irrelevant.) This suggests, and the following theorem makes more precise, 
that the process will either die out or grow very quickly, with the outcome 
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determined by the first few steps, and that the activation can spread from 
a set of constant size to the entire graph with a positive probabihty, which, 
however, is bounded away from 1. 

Theorem 5.6. Suppose r > 2 and p ~ cn~^/^ for a constant c > 0. 

(i) // a is fixed with a>r, then 

(5.6) F{A* = n)^C{a,c) 

for some C('i)C) G (0,1). Furthermore, there exist numbers C,{a,c,k) > for 
k>a such that F{A* = k) ^ ({a, c, k) for each fixed k> a, and Yl'kLa C(o, c, 
k) + C{a,c) = 1. 

(ii) // o — ^ oo, then F{A* = n) — > 1, that is, A* = n w.h.p. 

Remark 5.7. The limiting probabihties in Theorem 5.6 can be ex- 
pressed as hitting probabihties of an inhomogeneous random walk. Let £ 
Po((r-i)'^'')' ^ > 1, be independent, and let Sk ■= Yl^=iiCj ~ 1) and T := 
min{fc : a + 5fc = 0} G N U {oo}. Then 

(5.7) C{a, c) = P(r = oo) = P(a + 5fc > 1 for all k>l) 

and C,{a,c,k) =P(T = k). Consequently, Theorem 5.6(i) can also be written 
as 

dTv(^*,min(f,n)) ^0, 
where (ixv is the total variation distance. 

If the probability of connections p is even larger, p ^ n~^l^ , then the 
initial set percolates as long as a > r. 

Theorem 5.8. Letr>2. If p':^ n~^/'^' anda>r, then A* = n w.h.p. 

Remark 5.9. The case r = 1 is different. In this case, infection is equiv- 
alent to activation, and spreads to every vertex connected to an active ver- 
tex. Thus the final active set is the union of the components of the graph 
that contain at least one initially active vertex. It is well known that this 
is equivalent to the Reed-Frost model for epidemics, where each infected 
person infects everyone else with probability p, all infections being indepen- 
dent. (This equivalence is easily seen by the argument in Section 2.) The 
Reed-Frost model has been much studied; see, for example, von Bahr and 
Martin-Lof [48], Martin-Lof [37, 38]. We state some known result for com- 
parison with our results for r > 2; proofs can be found in [37, 48], where also 
further details are given (including central limit theorems as in Section 3.2), 
or by modifying the proofs of the results above. Many results follow also 
easily from known results on the component structure of Gn,p- 



BOOTSTRAP PERCOLATION ON G 



21 



If p = logn/n + u}{n)/n, with oj{n) — )• oo, then w.h.p. Gn,p is connected 
and thus A* = n as soon as a ^ 0. More generaUy, if p ^ and a > 1, 
then w.h.p. A* = n — o{n); cf. Theorem 5.8. 

The case p = c/n is perhaps more interesting. There are many (w.h.p. 
> c'n) isolated vertices, so we cannot have percolation or almost percolation 
unless a/n— 7-1. If c>l, there is a single giant component of size pn + Op (n) , 
with p = p{c) > 0, and thus, if a > 1 is fixed, then there is a dichotomy, with 
either A* = o{n) or A* = pn + o(n) w.h.p., with probabilities converging to 
the positive (1 — p)" and 1 — (1 — p)'^, respectively; cf. Theorem 5.6. 

If c < 1 and a is fixed, then, by the same argument, A* converges to 
the total size of a Galton-Watson process with Po(c) offspring distribu- 
tion and a initial individuals (a Borel-Tanner distribution). Thus A* /a is 
stochastically bounded but does not converge in probability to a constant; 
cf. Theorem 5.2(i). 

If c < 1 and a — )• oo but a = o{n), then A* /a — ^ 1/(1 — c); cf. Theo- 
rem 5.2 (i). 

If p ^ c/n with any c > and a On with > 0, then A* /n — ^ xq{9), 
where xq is the unique positive root of (5.4), where now f(x,c,9) = 1 — 
X — (1 — 9)e~'^^ by (5.3) with r = 1. In other words. Theorem 5.2(iii) holds 
for r = 1, too, with Cc = oo, and there is no threshold. [For = there is 
the well-known threshold at c = 1, but note that also in this case, xq is 
continuous in both c and 6 and there is no jump as in Theorem 5.2(iv).] 

6. Overview of the proofs. By (2.3), (2.2) and (2.9), for n = 1, 2, . . . 

T>u min(^(t) - t) > 

t<u 

a + min(S'(t) - t) > 0. 

t<u 

Hence, A* = T is completely determined by the stochastic process 
mmt<:uiS (t) — t), u> 1. In particular, ^(0) percolates if and only if a > 
-inmt<:n{S{t) -t) = maxf<„(t - 5(t)). 

Note that (6.1) is an exact representation of A*; we have not yet made 
any approximations. To obtain asymptotic results, we introduce some simple 
approximations. We give an informal overview of the argument here; details 
will follow in later sections. 

First, S{t) RiES'(t) by the law of large numbers. A simple calculation 
will show that f{t) := KS{t) — t starts at for i = 0, then decreases to 
a minimum at i ~ tc given in (3.1), and then increases until E5(i) ~ n and 
thus f{t) Tun — t; then f{t) ^ n — t holds until t = n, so f{t) decreases again 
in this range to a final value f{n) = MS{n) — n ~ 0. 

There are thus two candidates for the minimum point of S{t) — t: either 
t w tc or t ~ n. What happens at t ~ n makes the difference between almost 



A*>u 



(6.1) 
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percolation and complete percolation; we will study this too in detail later, 
but for the moment we ignore it and concentrate on whether we have almost 
percolation or not, and we see that, roughly, this is determined by whether 
a > —{Site) — tc) or not, which can be approximated by o > — (ES'(tc) — ic)- 
A simple calculation yields E,S{tc) — —a^, which establishes the thresh- 
old at ac- 

This argument also gives the following picture of the course of the acti- 
vation A{t) in the critical case a~ ac- (We leave the modifications in the 
subcritical and supercritical cases to the reader.) We start with ^(0) = a. At 
first, there are very few new vertices that reach the threshold of r infections, 
and the number A{t) — t of unused vertices goes down, and approaches as t 
approaches ic- However, the rate of activation of new vertices is increasing, 
because a pool of vertices with r — 1 infections has accumulated, and as 
t^tc, new vertices are activated at about the same rate as they are used. 
There are now two possibilities: either the activation dies out at this point, 
with a total size about tc = r/{r — l)ac, or it survives this bottleneck, and 
it then rapidly grows after time tc until almost all vertices are active. In the 
latter case there are again two possibilities: either all remaining vertices are 
finally active (complete percolation), or a few are not. 

7. Approximation of S{t) by its mean. For notational convenience, we 
assume that Vn \ -4.(0) = {1, . . . , n — a}. Note first that (n — a)~^S(t) = (n — 
a)^^ ^^1=1 ^ t} is the empirical distribution function of {li}"""^. By the 
law of large numbers for the binomial distribution (2.10), for every t = t{n), 
S{t) = {n — a)TT{t) + Op{n). Moreover, by the Glivenko-Cantelli theorem ([35], 
Proposition 4.24), the following holds uniformly for all t: 

Lemma 7.1. supt>Q\S{t) -ES{t)\ = Op{n). 

Proof. If n — a> y/n, say, this is a weaker version of [35], Proposi- 
tion 4.24. For smaller n — a, the result is trivial, since < S{t) <n — a. □ 

For small t, the uniform error bound in Lemma 7.1 is not good enough. 
[It can be improved to Op(n^/^), see Lemma 7.3, but this too is too large for 
our purposes.] For each t, (2.13) gives a bound Op{{mT{t))^/^). We extend 
this to a uniform bound for a range of t by a martingale argument. We begin 
by introducing a pair of well-known martingales for empirical distribution 
functions. (See [31], Lemma 2.1, for a continuous time version.) 

Lemma 7.2. The stochastic process 




S{t)-ES{t) 
l-vr(t) 



t = 0,l,.... 
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is a martingale, and 

, S(t)-ES(t) 

(7.2) ^^ t = r,r + l,..., 

7r(t) 

is a reverse martingale. 

Proof. Since S{t) is a sum of n — a i.i.d. processes l{Yi < t}, it suffices 
to treat each of these separately, that is, for the first part to show that, for 
each i, 

Y(f^ - Y m - m<t}-ny^<t) _ l{Y,>t} 

is a martingale. This is elementary: ii Yi <t, then X(t) = X{t + 1) = 1. If 
Yi>t, then X{t) = —■K{t)/{\ — n{t)) either jumps to X{t + \) = 1 or decreases 
to X[t + 1) = — 7r(t + !)/(! — 7r(t + 1)), and the conditional probability of 
these events are (7r(t + 1) - vr(t))/(l - 7r(t)) and (1 - 7r(t + !))/(! - 7r(i)), 
respectively, so a simple calculation yields E(X(t + \)\Yi>t) = — 7r(t)/(l — 
7r(t)). [Alternatively, this follows from the case X[t) = 1 and the fact that 
EX(t + l)=EX(t).] Hence, E(X(t + l) \ X{1), . . . ,X{t)) = X{t). 

For the second part, we similarly find that X{t) := < i}/7r(t) is a re- 
verse martingale, that is, that E(X(t) | X{t + 1), . . .) = X{t + 1). □ 

Lemma 7.3. For any to, 

(7.3) E(sup|5(t) -E5(t)|)^ < 16n7r(to), 



(7.4) Efsup|5(t) -E5(t)|)^ < 16n(l-7r(to)). 



Proof. Assume first 7r(to) < 1/2. Let i(t) := {S{t) -ES{t))/{l -7r{t)). 
This is a martingale by Lemma 7.2, and Doob's inequality ([35], Proposi- 
tion 7.16) yields, using (2.13), 

E(sup|5(t) -E5(t)|)^ <Esup|^(t)P 
H<to ' t<to 

(T.5) ,4E|«,,P^4^^^ 

< 8n7r(to), 

which proves (7.3) in this case. Similarly, if 7r(to) > 1/2, then we obtain, 
using the reverse martingale (7.2), 

(7.6) e(sup \S{t) -ES{t)\Y < 4 ^^'^^^°^ < 8n(l - 7r(to)). 
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Now, let ti be the largest integer such that vr(ti) < 1/2. We can apply (7.5) 
with to = ^1 and (7.6) with to = ti + 1, and thus 

E(sup\Sit) -ESit)\Y <E(sup\S{t) -ES{t)\ 

+ e( sup |5(t) -E5(t)|)^ 
< 8n. 

This immediately implies (7.3) for 7r(to) > 1/2 and (7.4) for 7r(to) < 1/2. □ 

8. Approximation of ES'(t) and proofs of Theorems 3.1 3.4. For (real) 
t>0 and pt < 1, say, by (2.11), 



iP-(l + 0{pt + t-^)) 



[cf. (3.17)], and thus, by (2.12), 

4-T T 

(8.2) E5(t) - = Oinfp'^ipt + 1'^ + a/n)). 

It thus makes sense to approximate /(t) := E5'(t) — t by f{t) := n{tpY /r\ — t. 
An elementary calculation shows that / has, on [0,oo) a unique, global 
minimum at tc given by (3.1), and that the minimum value is /(tc) = —clq- 
We obtain, for example, the following estimate. 

Lemma 8.1. Suppose that r >2, <^p <^ n~^/^ and a = o{n). Then 

1 



sup 

0<a;<10r 



S(xtc) X^tc 

r 



■ Op(ic 



Proof. First, (8.2) and (3.4) yield, recalling (3.5), uniformly for x < lOr, 
ES{xtc) = nx''^(l + o{l/x)) = x''-{l + o(l/x)) = — tc + o(tc). 

ry] ry ly 

Further, Lemma 7.3 yields by (8.1) and (3.4), 
sup |5(xtc) - E5(xtc)P = Op(n^(10rtc)) = Op{nfy) = Op{tc) = Op(t^), 

0<x<10r 

and the result follows. □ 

We shall use Lemma 8.1 to prove now that in the subcritical case (a ~ aac 
with a < 1) there exists t <tc such that w.h.p. A{t) < t and then determine 
the precise value of A(T) = T. 
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Proof of Theorem 3.1(i). The assumption on a may be written 
by (3.2), 

(8.3) a = (a + o(l))ac = (a(l - r"^) + o(l))tc. 
Hence, (2.9) and Lemma 8.1, taking x = 1, yield 

A{tc) - tc = S{tc) + a - tc = tc/r + Op(tc) + a - tc 

= tc(r-' + a(l-r-i)-l+Op(l)). 

Since a(l - r'^) < 1 - w.h.p. A{tc) - tc < 0, and thus, by (2.2), T < t^. 

We apply Lemma 8.1 again, now taking x = T/tc, and see that S{T) = 
{T/tcY'tc/r + Op(tc). Since S{T) = A{T) -a = T-a,we find, using (8.3), 
that 

T-a{l- r~')t, = S{T) + o{U) = (J-J*-^ + Op(tc) 

and thus 

(8.4) r-_^r-l)a=l^-j +Op(l). 

Since the function h{x) := rx — x"^ is strictly increasing from to r — 1 on 
[0,1], (8.4) implies (using the fact just shown that T/t^ < 1 w.h.p.) that 

T/tc -!-)■ y, where y is the unique root in [0, 1] of h{y) = (r — l)a, that is, 
y = (/?(«) given by (3.11). 

This proves the first assertion, and if a > 0, the second follows. If a = 0, 
then a = o{tc), and (8.2) implies, for every fixed A > 0, E5(Aa) = 0{na^p^) = 
o{ant^~^p'^) = o(a). Hence, for every fixed A > 1, A{Xa) = S{Xa) + a = a + 
Op (a), so w.h.p. A{Xa) < \a, and thus a<T < Xa. Consequently, when a = 0, 

T/a^l. □ 

We turn to the proof of the supercritical case in Theorem 3.1. The follow- 
ing lemma shows that if the process of activation can escape the bottleneck 
at tc, then the process continues until (almost) percolation. The idea is to 
split the time interval [3ic,'^] into different intervals. Then in the proof of 
Theorem 3.1(ii) and (iii), it remains to show that if a is supercritical, then 
A{t) > t for t < 3tc. 

Let b* := bcUj{n), where uj{n) — t- oo slowly but is otherwise arbitrary. 

Lemma 8.2. Suppose that r>2 and ^ n~^/^ . Then, for any a, 

w.h.p. A{t) > t for all t G [3tc,n- b*]. 

Proof. By (2.9), A{t) = Sn-a{t) + a> S'„(t), so it suffices to show that 
Sn{t) > t (or equivalently, to take a = 0). We separate the proof into a num- 
ber of different cases for different ranges of t. We assume at some places, 
without further mention, that n is large enough. 
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Case 1: t G [3tc,8rtc]- By Lemma 8.1, w.h.p. for all such t, 

1 f t\' 3''^^ 3 

Sn{t)>-{j \ t,-t^>—-t-t,>-t-tc>t. 

Case 2: t e [8rtc,p~^]. Let tj := j > 1, and let J := min{j > 1: 

ptj > 1}. For tc <t < p~^, using (3.1), 

vr(t) > (J) /(I -p)*- = ^/e-*P(l +0(1)) 

-3~~3\Tc) r! ~3mV*^y 

Hence, for 3 < j < J — 1, E5'„(tj) = mi{tj) > ^tj > and thus, using 
Chebyshev's inequality and (2.13), 

F(S..(*,) < 2t,) < P(5„(*,, < ^E5„fe)) < ^(^l^^l^ i^^i^^ 
Hence, 

J-i 

IP(5'„(t) < t for some t G [8rtc,tj]) < ^P(S'„(tj) < 2tj) 

i=3 

12 2 
<>-< — < — = o 1). 

j=3 

Case 3: t G [p~^,cin] for a suitable small ci > 0. Let t[ := [p""*^]. Then 

7r(t;) = P(Bin(ti,p) > r) = P(Po(t'ip) > r) + 0(p) > 2ci 

for some small ci. Hence w.h.p. 5„(t'^) > cin and consequently Sn{t) > 
'S'n(^'i) > cin > i. 

Case 4: t G [cin, n — Let t2 L'^i'^J and t'^ :=n — p~^. Then 
l-7r(4)=P(Bin(i'2,p)<r) 

= 0((t^p)'-ie-*2?') = 0((np)^-ie-'=i"P) 
= o{{np)-^). 

Thus, E(n — 5'„(t2)) = n(l — vr(t2)) = o(p~^), and w.h.p., n — S'„(t2) < 
that is, Sn{t'2) > n — p~^ = t'^. 

Case 5: t £ [n — p~^,n — b*]. We have t'^ := n — p~^. Then 

l-7r{t'^)=¥(Bm{lt',\,p)<r) 

= Oi{t',pr-'e~''^n = 0{{npY-'e-^n 
= 0{bjn). 

Hence, E(n - 5.„(t^)) = n(l - -Kit'^)) = 0{bc) = o{b*), and thus w.h.p. n - 
Suit's) < b*, that is, Snit'^) >n-b*. □ 
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Remark 8.3. The proof shows that once we reach at least 1/p active 
vertices, the active set wiU w.h.p. grow to at least n — b* in at most 3 
generations. (Hence, the size then is n — Op(6c); see [33], Lemma 3.) 

Lemma 8.4. 



attained at x = 1 only. 

Proof. Elementary calculus. □ 

Proof of Theorem 3.1(ii) and (hi). For < t < 3tc, we may assume 
a < 3tc since otherwise A{t) > t trivially. In this case. Lemmas 8.1 and 8.4 
(with X = t/tc) show that w.h.p., uniformly in t < 3tc, 



This and Lemma 8.2 show that w.h.p. A{t) > t for all t <n — b*, and thus 
A*>n-b*. 

Hence n — A* <b* = bcUj{n) w.h.p., for any choice of uj{n) — t- oo, which 
is equivalent to n — A* = Op(&c); see, for example, [33], Lemma 3. This 
proves (ii). 

If be — 5- 0, we may choose b* = 1; then w.h.p. n — A* < 1, so A* = n. 
Conversely, if 6c A 0) then, at least for a subsequence, there exists with 
probability at least c > a vertex with degree < r — 1 , and with probability 
1 — a/n, this vertex will never be activated so A* < n; see Remark 3.3. This 
proves (iii). □ 



Proof of Theorem 3.2. Choose b* := npbc > 6c- By (3.3), b*p = 
{npY~^^ e~^P / {r — 1)! — )• 0. Hence, (n — b*)p = np + o(l) — )■ oo and 

1 - 7r(n - 6*) = P(Bin( [n - 6*J , p) < r - 1) 




A{t) =a + S{t) >{1 + 6){1- r~^)tc + - - - o(tc) 




> 6{1 - r~^)tc + ^tc - o{tc) > t. 



■c 



(n-6*)^-V-i 
(r- 1)! 

1 — 7r(n). 



(l-p) 



Consequently [see (3.7)], 

E{A{n) - A{n 

(8.5) 



b*))=E{S{n)-S{n-b*)) 
< n(7r(n) — 7r(n — 6*)) 
= o(n(l-7r(n))) = o(6;). 



28 



JANSON, LUCZAK, TUROVA AND VALLIER 



By assumption and Lemma 8.2, w.h.p. T > n — b* , and thus A{n — b*) < 
A{T) < A{n). Hence (8.5) implies 

(8.6) A*=T = A{T) = A{n) + Op{b'J. 
Further, 

(8.7) n — A{n) = n — a — S{n) G Bin(n — a, 1 — '/r(n)) 

with mean (n — a)(l — '/r(n)) b'^; see (3.7). 

If be — oo, then b'^ ~ b^; thus (8.7) imphes n — A{n) = b^ + Op(5c), and 
(8.6) yields (i). 

In 6c 6 < oo, then b'^ = 6c + o(l) b; thus (8.6) yields A* = A{n) + Op{l), 
and hence (since the variables are integer valued) A* = A{n) w.h.p. Further, 

in this case (8.7) implies n — A{n) Fo{b), and (ii) and (iii) follow. □ 

Proof of Theorem 3.4. An easy consequence of Theorem 3.1. □ 

We end this section with a proof of Theorem 4.4, where we start with 
a number of external infections (but no initially active vertices). As said 
in Section 4.2, we do this by a minor variation of our method. We include 
this proof to show the flexibility of the method, but we omit parts that are 
identical or almost identical to the proofs above. 

Proof of Theorem 4.4. In order to preserve independence between 
vertices, we consider the model with a Poisson number W S Po(^) of external 
infections (independent of everything else). Then each vertex i receives Wi G 
Po(^/n) external infections, and these random variables are independent. 
The analysis in Section 2 becomes slightly modified: the number of infections 
(marks) at time t now is M^{t) := Wi + Mi{t), so Yi is replaced by := 
min{t :Mf (t) > r} and S{t) is replaced by S^'{t) := ^.1=1 < We 

now have A{t) = Si'it), so A* =T = min{t > : S'^(t) = i}- 

We take /x = ynpa^ for a fixed y > and claim that if y < 1, then w.h.p. 
A* < tc/r and thus Jq > W, while if y > 1, then w.h.p. A* = n — Op{n) and 
thus Jo < W. The result then follows by taking y = 1 it e/2 for small e > 0. 

To prove these claims, we first note that ¥.S^{t) = nP(Aj['(t) > r) with, 
for such /i and t = 0{tc), 

P(Mf (t) > r) = F{Wi + Mi{t) > r) 

r-l 

(8.8) = ¥iWi = i)P(M,(t) >r-j)+ P(VFi > r) 

j=0 
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We obtain as in Lemma 8.1, using versions of Lemmas 7.2 and 7.3 for S^{t), 

Op(tc)- 



(8.9) sup 

0<x<10r 



5^(xtc)--(x + yoc/tc)"tc 
r 



Recall that ac/^c = 1 — 1/r by (3.2). If y < 1, then (8.9) with x = l/r implies 
that w.h.p. 

and thus A* = T < t^/r as claimed. 

Conversely, if y > 1, then Lemma 8.4 shows that 

{x + yajt^ ar(l \ ^, 1 
>a; + y— + --l =x + (y-l) l-- 
r \r J \ r 

Hence, (8.9) shows that w.h.p. A{xt^) = S^{xtc) > xt^ for x < lOr, and thus 
A* = T > lOrtc. Further, since S'^{t) > Sn{t), Lemma 8.2 implies that w.h.p. 
A{t) > t for all t G [3tc,n — b*], and thus A* > n — b* w.h.p., which proves 
the second claim and completes the proof. □ 

Note that (8.8) and (8.9) show that A{t) = S^'it) is, to the first order, 
E5„(t) shifted horizontally by fj,/{np) = yac, while in our standard model 
A{t) is St shifted vertically by a. Since we study the hitting time of the 
linear barrier A{t) = t, these are essentially equivalent. 

9. Proofs of Theorems 3.6-3.8 and 4.5. We begin with an estimate of vf (t) 
defined in (3.13). 

Lemma 9.1. Suppose thatr > 2 and <^p <^n~^^'^ . Then, for large n, 
nTxit) > lAt for t £ [3tc,n/2]. 

Proof. Assume not. Then we can find, for a subsequence n = nj^^ oo, 
t = tk £ [3tc,n/2] such that n7r{t) < lAt. Selecting a subsequence, we may 
further assume that pt^ z £ [0,oo]. We consider three cases separately. 

(i) z = 0, that is, pt 0. Then, from (3.13) and (3.4), 

(ptY r 3*^-1 3 

nirit) ~ n-^i-f- = ^ > 1 > -t. 

^ ' r\ rfc~^ ~ r "2 

(ii) pt^ z £ (0, oo). Then n7r{t) ~ nip{z) with ip{z) > 0, and t = 0{l/p) = 
o(n) <C nT\{t). 

(iii) z = oo, that is, pt — )• oo. Then nn^t) ~ n > 2t. 

In all cases we have for large n a contradiction to n7r(t) < lAt. □ 

Lemma 9.2. Suppose r > 2 and a — )• oo with a = o{n). Then pc and p* 
defined by (3.12) and (3.19) satisfy p* ~Pc- In particular, -^Pc ^n"^^"^ . 
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Proof. Let p = ypc for some fixed y > 0, and define and by (3.1) 
and (3.2). Then = y'^'-^'^/'' {r / {r - l))a and = ay'^'-^'^/'' . Further, 
<C p <C n"^/'' and, by (3.5), pt^ — )• and = o{n). Hence, if x = 0(1), 
and t = xtc, then pt = o(l), t = o{n) and, uniformly in bounded x, by (3.13) 
and (3.4), 

m=^^ + 0{{ptY^') = ^{l + o{l)). 
r! nr 

Hence, uniformly in x <3, 

(9.1) {n-a)Tr{t)-t= (^^+o{l)-x^tc. 

By Lemma 9.1, for large n, (n — a)7r(t) — t > for i e [3tc,n/2], and thus, 
by (9.1) and Lemma 8.4, 



(9.2) = |^i_i + o(l)^t, = -(l + o(l))ac 

= _(y-(-i)A + o(l))a. 

Hence, ify = l — 5< 1, then > 1 and thus, for large n, 'y{p) < —a so 

pI> p = {1 — 5)pc- Conversely, ify=l + (5>l, then (9.2) yields, for large n, 
lip) > -a so pI <p = (1 + 5)pc. 
Consequently, pl/pc — )■ 1. □ 

We also need more precise estimates of S{t). The following Gaussian pro- 
cess limit is fundamental. D[0, B] denotes the space of right-continuous func- 
tions on [0, B], with the Skorohod topology; see, for example, [16] (for B = 1; 
the general case is similar by a change of variables) or [35], Chapter 16. 

Lemma 9.3. Suppose r> 2 and a— t-oo witha = o{n). Then 

(9.3) Z{x) := ^(^^c)-E5(xtc) ^ ^(^r/^) 

in D[0,B] for any fixed B, where W is a standard Brownian motion. 

The conclusion, convergence in D[0,B] for every fixed B, can also be 
expressed as convergence in D[0,oo). 

Proof of Lemma 9.3. This is a result on convergence of empirical 
distribution functions (of {1^}); cf. [16], Theorem 16.4; we get here a Brow- 
nian motion instead of a Brownian bridge as in [16] because we consider for 
each B only a small initial part of the distribution of YJ. 
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For every fixed x > 0, by (8.1) and (3.5), n^xtc) ~ {xtcpY /r\ — ^ 0, and 
thus by (2.13) and (3.4) 

Var S(xtc) ~ mrlxtc) ~ ; — - = > oo. 

Hence (2.10) and the central hmit theorem yield Z{x) N{0,x^ /r) for 

every x > 0, which proves Z{x) W{x^ /r) for each fixed x. 

This is easily extended to finite-dimensional convergence: Suppose that 
< xi < • • • < X£ are fixed, and let lij := l{Yi £ {xj-itc, Xjtc]}, with xq = 0. 
Thus, S{xjtc) — S{xj^itc) = YllZi hj- Then, for l< j < i and k / j, 

Elij = TT{Xjtc) - TT^Xj^itc), 

\aTlij = EIij{l-EIij)^EI,j 
= Tr{xjtc) - 7r(xj_itc) ~ 

Cov{Iij,Iik) = - EhjEhk = 0{TT{x£t,f) = 0{{tJnf)=o{tJn). 

Note that {IijYj=i, i = 1, 2, . . . , n, are i.i.d. random vectors. The multi-dimen- 
sional central limit theorem with (e.g.) the Lindeberg condition (which fol- 
lows from the one-dimensional version in, for example, [28], Theorem 7.2.4, 
or [35], Theorem 5.12, by the Cramer- Wold device) thus shows that {Z{xj) — 

Z{xj^i))j^-^ {VjYj^i with Vj jointly normal with KVj =0, VarV^- = 

x^j/r — x'j_^/r and Cov{Vj,Vk) = for j / k. Hence, {VjYj^-^ = {W{x^j/r) — 

W{x'^_,/rY=iY, and thus {Z{xj)Y,=i ^ {W{x-j/r)Yj^,. 

To show (9.3), it thus remains to show tightness of Z(x). We use [16], 
Theorem 15.6, with 7 = 2 and a = 1 (an alternative would be to instead use 
Aldous's tightness criterion ([35], Theorem 16.11)); it thus suffices to prove 
that, for every xi,X2,X3 with < xi < X2 < a^s < -B, and some constant C 
depending on B but not on n or xi,X2,X3, 

(9.4) n\Z{x2) - Z{XI)\^\Z{X3) - Z{X2)\^} < C{X3 - Xif. 

With the notation above and I[j := lij — E/jj, the left-hand side of (9.4) can 
be written 

n—a n—a 
i,j,k,l=l i,j,k,l=l 

By independence, the only nonzero terms are those where i,j,k,l either 
coincide in two pairs, or all four indices coincide, and it follows easily that 
(for any i) 

(9.5) E{|Z(X2) - Z(xi)|2|Z(x3) - Z{x2)\''} < 3t-\n - af¥.h2EIa. 
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Further, since each Yi is integer- valued, the left-hand side of (9.4) vanishes 
unless there is at least one integer in each of the intervals {xitc,X2tc] and 
{x2tc,X3tc], which implies that x^tc — xitc > 1, so we only have to consider 
this case. It follows from (2.7) that for m < x^tc < Bt^, 

^r— 1 Tjr— lj.r— 1 -r Rr— 1 
P 1^ =m < —f < ^ J" = 

(r — 1)! (r — Ij! n 

and thus, assuming xstc — xitc > 1, 

j^r—l B^~^ 
E/i2 + E/i3 < ([Xgtcj - [xitcD < {X3tc-Xit, + 1) 



< 2{x3tc — Xitc 



n n 



n 



Consequently, (9.5) yields. 



E{|Z(X2) - Z{xi)Y\Z{x3) - Z{X2)V] < 3-^ 2(x3tc - x^t 



= l2{x,-xifB'^^-^\ 
which proves (9.4) with C = 12fi2(r-i). The proof is complete. □ 

We also need a more careful estimate of 7r(t) than above, and we use the 
corresponding Poisson probability 7r(t) defined in (3.13). 

Lemma 9.4. Assume <^p <^ n^^l''' . Uniformly for t > 1, 7r(t) = 
7f(t)(l + 0{t~^Y). In "particular, uniformly for t < 3tc, 

7r(t) = n{t) + 0{{ptY/t) = n{t) + 0{n{tc)/tc) = n{t) + 0{n~^). 
Proof. Assume first pt < 1. By (2.11), 

7r(t) = F{Bm{t,p) = j) = p^^lj^l + O (^^^^ (1 - :p)*+'^(^) 



oo 



= E ^e-P*+0(*^'^)(l + 0{f/t + jp)) 

j=r ^' 

= n{t){l + 0{tp^ + t~^+p)) = 7^(t)(l + 0(t"^)). 

For pt > 1, Tr{t) is bounded below, and the result follows from (3.15). 

If t < 3tc, then t = 0{tc) = o{l/p) by (3.5), and thus, using (3.13) and (3.4), 

7r(t) - n{t) = Omt)/t) = Oiipty/t) = 0{{ptj/tc) = 0(l/n). □ 

Proof of Theorem 3.6. It suffices to consider a such that a ~ Oc = 
(1 — r~^)tc- It then follows by (2.9) and Lemma 8.1 that, uniformly for 
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X < lOr, 

A{xtc) — xtc = a + S{xtc) — xtc = ac H — x^'tc — xt^ + Op(tc) 
(9.6) 1 \ 

By Lemma 8.4, the coefficient 1 — + x^'/r — x equals at x = 1 but is 
strictly positive for all other x > 0. It follows that for every (5 > 0, w.h.p. 
A{xtc) — xtc > for all x £ [0,1 — 6] U [1 + 6, lOr]. By a simple standard 
argument, there thus exists a sequence 5„ — )• 0, where we may further assume 
that 6n > \tl/tc — 1| , such that w.h.p. A{xtc) — xtc>0 for all x £[0,1 — 6n] U 
[1 + Sn,10r]. 

Hence, w.h.p. either T £ [{l-6n)tc, {1 + Sn)tc], or A{t) > t for ah t < lOrtc, 
in the latter case, for any b* ^ be, w.h.p. A{t) > t for all t < n — 6* by Lem- 
ma 8.2, so T > n — b*; hence T = n — Op (6c) and, more precisely, provided 
a = o(n). Theorem 3.2 applies. 

We thus only have to investigate the interval [(1 — 6n)tc, (1 + 5n)ic] more 
closely. By the Skorohod coupling theorem ([35], Theorem 4.30), we may as- 
sume that the processes for different n are coupled such that the limit (9.3) 
holds a.s., and not just in distribution. Since convergence in D[0,B] to 
a continuous function is equivalent to uniform convergence, this means that 
(a.s.) Z{x) — )• W{x^ /r) uniformly for x < B; in particular, uniformly for 
x£ [1 - 5n,l + 6n], 

S{xtc) = (n — a)7r{xtc) + tl^"^ Z{x) 

(9.7) 

= (n - a)TT{xtc) + tl^\w{l/r) + o(l)). 

Let ^ := W{l/r) £ N{0, 1/r). Then, by (9.7) and Lemma 9.4, uniformly for 
x£[l- 5n,l + 6n], 

Sixtc) = (n - a)#(xtc) + 0(1) + tl^\C + o{l)) 

(9.8) 

= (n-a)#(xtc)+ty^e + o(l)) 
and thus, refining (9.6), 

A{xtc) — xtc = a + S{xtc) — xtc 

(9.9) 

= a + {n — a)TT{xtc) — xtc + ^c^^C + Op{t\^'^). 
Hence, recalling (3.16) and that the minimum there is attained at t* £ [(1 — 

6n)tc,{l + 6n)tcl 

A{t) - [t\ 
mm -— 

iG[(l-<5„)ic,(l+'5n)ic] l-7r(*) 
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(9.10) =a+ min ^^^^ + + Op(ty^) 

= a-a* + ty'e + Op(iy'). 

We have shown that w.h.p. A* = T < {1 + Sn)tc if and only if this minimum 
is < 0, and otherwise T = n — Op(6c)i and the results follow; for (i) we also 
observe that (9.9) and (9.10) imply that w.h.p. ^(t*) — t* < and thus 
T < t*. For example, in (iii) we have 

a-a* + tl'^i + Op(tJ/') = yal'^ + tl'^i + o^{tl'^) 

= ((^_l)l/2y + ^l/2^ + Op(l))(^c/r)l/^ 
and the probability that this is positive tends to 

P((r - if'^y + > 0) = ^>((r - if^y), 
since r ^/2^GiV(0,l). □ 

Proof of Theorem 3.7. It suffices to consider p ~ Pc ~ Pc, which 
implies that Oc = ac{p) ^ adPc) = a. Hence the arguments in the proof of 
Theorem 3.6 apply. In particular, again it suffices to consider t £ J = Jn '■= 
[(1 — 5n)tc, (1 + 6n)tc], where now tc = tc(Pc) = (r/(r — l))a. The infimum 
in (3.18) is attained for some t = t**, where by Lemma 9.1 t** < 3tc for 
large n, and an argument as in (9.6) shows that t** ~tc. We may assume 
that 6n is chosen such that t** £ J. Then, by (9.9), 

(9.11) min{^(t) - [t\} = a + mm{{n - a)fr{t) - t} + tl^'^^ + Op{tl^'^), 

where, by (3.18) and the comments just made (for large n), 

(9.12) a + min{(n - a)n{t) -t} = a + jip) = -jip*) + 7(p). 

Further, writing (3.18) as 7(p) := mmt{F{tp) — t}, with F{x) = (n — a)Tp{x), 
we have at the minimum point t := t** the derivative pF'{pt**) — 1 = 0. 
Hence, uniformly for \pi — p*| < ep* and \t — tc\ < etc, for any e = e„ — 0, 
using (3.14), F'ipit) = (1 + o(l))F'(pt**) = (1 + o(l))/p* and thus by the 
mean- value theorem, for some pi between p and p*, 

F{tp) - F{tp*) = tip - p*)F'{tp,) = tc^^il + 0(1)). 

Since the minimum in (3.18) may be taken over such t only, for suitable e^, 
this yields 

7(p)-7(Pc) = tc^(l + o(l)). 

Pc 
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Consequently, (9.11) and (9.12) yield 

min{^(t) -lt\}= tc^-r^(l + 0(1)) + iJ'C + op(ty'). 

Hence, 

F(mm{A{t) - [t]} > = p(^ic/^^^ + ^ > 0^ + o(l) 

=p(-r^/^e<(rtc)^/^^)+o(i), 

where r^/^^ G -^(0, 1) and = fr^o, and the different parts of the theorem 
follow. □ 

Lemma 9.5. Suppose that r > 2 and <C p ^ n~^^''' . Then, for large n 
at least, the minimum point t* in (3.16) is unique, and ~ tci flc ~ ^c! rnore 
precisely, 

(9.13) t* = (^l + -^ + o(ptc))tc, 

(9.14) < = (l - i + + o(pt.)) = (l + ^ + o(ptc)) ac 
Proof. Let 

, , nnit) — t n — t 

Then 

^ -(l-ff(t)) + (n-t)ff^(t) ^ (n-t)7^^(t)+7^(t)-l 
(l-vf(t))2 (l-vr(t))2 

Let 

(9.16) /i(t) := (1 - ^{t)fg'{t) = {n- t)n'{t) + n{t) - 1. 

Then h'{t) = (n — t)'fr"(t) > for t < (r — 1) /p, and in particular (for large n) 
for t < Stc] see (3.5). Further, h{0) = — 1 and by (3.14) and (3.1), for large n, 

/i(3tc) = 3'-^-l + o(l)>0; 

hence, there is a unique t* G [0,3tc] such that h{t*) = 0, or equivalently 
g'{tl) = 0. Further, g"{tl) = h'{t*)/{l - 7f(t*))2 > so t* is the unique mini- 
mum point of g{t) in [0,3tc], as we defined t* after (3.16). 
Let x = t*/tc £ [0,3]. Then, by (3.14), (3.1) and (3.17), 



o = .K) = (i-|).'-'.-'--+o(£)-i 



n 



36 JANSON, LUCZAK, TUROVA AND VALLIER 

Hence, recalling <^ p and pt^ — > 0, 

(9.17) X = eP*^*/(^-i) (^1 + O (^^^ ^ = 1 + + o{pQ. 

In particular, x = l + o(l), so t* ~ to and (9.13) follows from (9.17). Fi- 
nally, substituting (9.13) in (3.16) yields, using (9.15) together with 7r(tc) = 
0{tc/n) = o{ptc) by (3.17) and ^P, and also (3.13) and (3.17), 

I = _s|> . (1 + o(,pt.))'l^fSl = (1 + „(p,,,)(. - 

= (1 + o(ptO) (x - (l + + o(pt^)) ) 



= (1 + o{pt,)) X - —e-P'^ 1 + ^ + o(ptc; 
y r y r + 1 

= a;-— (l-ptc+^ ) +o(ptc) 
r \ r + 1 y 

= X - — + - ( ptc - ) + o(p*c), 

r r \ r + 1 / 

and (9.14) follows by (9.17) and x - x" jr = 1 - 1/r + 0(x - 1)^. □ 

Proof of Theorem 3.8. In case (i), that is, when a < 1, by Theo- 
rem 3.1(i), 

(9.18) r = ^* = (99(a) +Op(l))tc. 

By Theorem 3.6(i), this holds as well in case (ii), that is, when a = 1 and, 
correspondingly, 99(0) = (/j(l) = 1. Thus, for any a < 1 there exist (5„ — )■ 
such that w.h.p. T G /„ := \{^p{a) — bn)t^\^p{a) +(5n)ic]- As in the proof 
of Theorem 3.6, we may, by the Skorohod coupling theorem ([35], Theo- 
rem 4.30) assume that the limit in (9.3) holds a.s., uniformly va. x<B. For 
t G Iri; i/^c —5" ¥'('^)> and (9.3) then implies that, uniformly for t G /„, 

5(t) =E5(t) +ty^Z(Vtc) 

(9.19) 

Let i := W{}p{aY jr) G iV(0, (/?(Q)7r). Then, by (9.19) and Lemma 9.4, for 

5(t) = (n-a)7f(t) + ty'(e + Op(l)). 
Since w.h.p. T ^ 1^, we may here substitute t = T, and obtain 
= A{T) - T = a + S{T) - T 

(9-20) 

= a + (n - a)7f (T) - T + ty'(^ + Op(l)). 
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Define the function g{t) by 

(9.21) g{t):=a + {n-a)7T{t)-t; 
tiius (3.22) is g{t^) = 0. Then we have shown in (9.20), 

(9.22) ~g(^T) = -tl/\c + Opil)). 

The function g is continuous on [0,oo) with ^(0) = a > 0. Consider the 
two cases separately. 

(i): WhenQ< 1 we have, by (9.21) and (3.17), g{tc) = a + (1 + o{l))tJr - 
tc = a — (1 + o(l))ac < (for large n), since a ~ aac- Further, on [Ojic], 
using (3.14) and (3.1), 

~g'{t) = {n- a^it) - 1 = ("l V \-P^ - 1 

(9.23) 

-i + o(i); 



this is negative for t < (1 — e)tc for any e > and large n, and it follows that 
(for large n, at least), g has a unique root in [0,tc]- It follows from (3.17) 
and (3.11) that t*/tc — fia)- 

Since also T/t^ — ^ ^(ct), (9.23) implies that g'{t) = —(1 — ip{aY~^) + 
Op(l) for all t between and T, and thus the mean value theorem yields 

giT) = g{T) - ~g{U) = (T - t,)(-(l - ^{af-^) + Op(l)), 

which together with (9.22) yields, recalling 99(a) < 1, 

T-u = -{{i- ipiay-'r' + opimiT) = ((1 - ^{ay-'r'c + opm'j'. 

The result in (i) follows. 

(ii): Let g{t) := g{t) / {1 — 7r{t)) — a and h{t) be as in the proof of Lemma 9.5, 
(9.15) and (9.16). We know that g{t*) = -a* and g'{tl) = 0. Further, for 
t~tc, we have by (3.5), (3.13), (3.17) and (3.14), 

7r(t)~ — — ~ — = o(l), 
r! rn 

r n 



t ntc 

Hence, by (9.16), /i(t) = o(l), /i'(t) = (n - t)7r"(t) ~ (r - l)/tc and 

/i^(t) , ^ hit)n'{t) 



(l-^r(t))2 (l-7r(t))3 
(9.24) 

" ^-(l + o(l)) + o(i)=^(l + o(l)). 
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Consequently, a Taylor expansion yields, for t ~ tc ~ ^c) 
(9.25) g{t) = -a* + - t*f{l + 

We have g{t^) = and thus g{t^) = —a. Further, (3.17) and (3.11) again 
yield U/t^ (p{l) = 1. Hence, (9.25) yields (3.23). 

Since Theorem 3.6 yields r = ic(l + Op(l)) and T<t* w.h.p., (9.22) yields 
g(T) = giT)/{l - n{T)) -a = -a- tl^^^ + Op(l)); thus, similarly, (9.25) 

1/2 

yields, using a* — a ^ tc , 



2tr . . 1/2, 



- T = (1 + Op(l)) W —^(a* - a - t','\i + Op(l))) 



= (1 + o^{l))^l^^(^) = (1 + Op(l))(t* - U). 

Hence, w.h.p., every t between T and t^, satisfies t* — t = {l + o(l))(t* — t*), 
and then by (9.24), 

g'{t) = (1 + o(l))^(t - tl) = -(1 + o(l))^(t* - U). 

Finally, the mean value theorem yields, similarly to case (i), 

g{T)-g{U) _ tV\i + o^{i)) 



T-U 



_(1 + o(l))((r - l)/t,){tl - U) (2(r - l)t~\al - a))i/2 ' 
and the result in (ii) follows, since ^ G -^(0, 1/r) and a* ~ Oc = ^^ic- D 

Proof of Theorem 4.5. We use the version described in Section 4.3 
where edges are added at random times. Let U be the time the active set 
becomes big, that is, the time the Mth edge is added. For any given p, then 
U <p \i and only if at time the active set is big, which is the same as 
saying that there is a big active set in Gn,p- Fix x G (— cxd,oo) and choose 
p = p* + (r — l)^/^r~^xa~^/^pc- Then Theorem 3.7 [with A = (r — lY/'^r~^x\ 

yields P([/ < p) <I>(x). In other words, {U -pl)/{{r- l)^/^r~^a"^/^pc) 
iV(0,l), or 

(9.26) f/G AsN(p*,(r- l)r-2a-Vc)- 

Let N{u) be the number of edges at time u. Then A^(0) = and, in analogy 
with Lemma 7.2, {N{u) - (2)^^)/(l 

— u) is a martingale on [0,1). Thus, 
Doob's inequality yields, as in the proof of Lemma 7.3, for any uq G [0, 1], 
E(sup„<„jA^(ii) - {^u\^) < 16(2)^0 = 0{n'^uo); cf. [31], Lemma 3.2. Hence, 



sup 

u<uo 



N{u) - ( ^ ) u 



: Op{nuQ 



1/2. 
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Choosing uq = 2pc, we thus obtain, since U < 2pc w.h.p., 



(9.27) M = N{U) = i^'^jU + Op{np'c 

We have by (3.12), for some constant c = c(r), 

-c - =o(l). 



Consequently, the error term in (9.27) is Op{n'^Pc/a^^'^), and the result follows 
from (9.27) and (9.26). □ 

10. The number of generations. Let Tq : = and define inductively 

(10.1) T,+i:=^(T,), j>0. 

Thus A{Tq) = A{<d) = 1^(0)1 = l^ol, the size of generation (the initially ac- 
tive vertices). Further, by our choice of ut as one of the oldest unused, active 
vertices, Z{Ti) = Z{A{Q)) = A{Q) = Go and ^(Ts) = Z{A{Ti)) = A{Ti) = 
QqU Gi; in general, by induction, all vertices in generation k (and earlier) 
have been found and declared active at time T^, and they have been used 
at time T^+i = A{Tf;). In other words, 

k 

[jgj=A{Tk)=Z{Tk+i), k>0. 

j=0 

In particular, the size of generation k equals 

IGkl = \Z{Tk+i) \ Z{n)\ = Tfc+1 - Tfc, k> 0, 
and the number of generations r defined by (1.1) is 

r = max{/c > : T^+i > T^} = mm{k > 1 : T^+i = T^} - 1. 

We begin by considering the supercritical case. We then consider the 
spread of activation in the bootstrap percolation process in three different 
stages in each of the following subsections. We first consider the bottleneck 
when the size is close to tc] we know that this is where the activation will 
stop in the critical case, and in the slightly supercritical case, the activation 
will grow slowly here, and this will dominate the total time. Then follows 
a period of doubly exponential growth, and finally, when there are only o(n) 
vertices remaining, it may take some time to sweep up the last of them. 
Recall that Example 3.11 shows that each of the three phases may dominate 
the two others. 

We define, for any m<n, 

(10.2) T(m) := inf{j : Tj > in} 

with the interpretation that r(m) = oo if this set of j is empty, that is, if 
m>A* = T. 
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10.1. The bottleneck. We consider first r(3tc), that is, the number of 
generations required to achieve at least 3tc active vertices. [The constant 3 
is chosen for convenience; any constant > 1 would give the same result within 
0(1) w.h.p.] In the really supercritical case, this is achieved quickly. 

Proposition 10.1. Suppose that r>2 and n^^ <^p<^ n~^/^. Assume 
a > (1 + 6)ac for some 5>0. Then, w.h.p. T(3tc) = 0(1). 

Proof. Lemmas 8.1 and 8.4 imply that uniformly for < t < 3tc, with 
X = t/tc, 

A{t) -t = S{t) -t + a= ^-x"" -x^tc + a + Op(tc) > -Oc + (1 + <5)ac + Op(ac) 
and thus w.h.p. 

Ait)-t>-a,>-t,. 

Hence, in this range, w.h.p. each generation has size at least (5/4)tc, and the 
numbers of generations T(3tc) required to reach 3tc is thus w.h.p. bounded 
by 12/(5. □ 

In the slightly supercritical case when a ~ Oc, this part may be a real 
bottleneck, however. We will approximate A{t) by deterministic functions 
and begin with a definition: given a function F : [0,oo) — )• [0,oo), define the 
iterates Tf^^ := F{Tf) with T^ := 0. Thus Tj = T/. 

Lemma 10.2. If A< F, then Tj < Tf for every j. If A> F, then 
Tj > T^ for every j . 

Proof. By induction. Assume, for example, A<F and Tj < TJ^ . Then, 
since A is (weakly) increasing, 

r,+i = A{Tj) < A{Tf) < F{T[) = Tf.,,. □ 
We next prove a deterministic lemma. 

Lemma 10.3. Let a,b,to> 0, and let F{t) ■.= t + a + b{t -tof . Assume 
a <t(j and bto < 1 . Let N be the smallest integer such that T^ > 2tQ . Then 

N={l + O{bto)) r ^_dx + 0(l). 

Proof. Assume that t G [0, 2to] and let A := F{t) — t. The assumptions 
on a and b imply < A < a + bt^ < 2to. For s G [t,t + A] we have \F'{s) - 
1| = \2b{s — to)\ < 6bto, and thus, by the mean-value theorem, \F{s) — s — 
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{F{t) -t)\< GbtoA = 6bto{F{t) - 1). Thus, uniformly for such s, F{s) -s = 
{F{t) - t){l + O{bto)) and thus - t)"^ = {F{s) - sy^{l + 0{bto)). 

Consequently, 

rt+A ^ 

= (l + 0(6to))/ —^ds. 

Jt a + b[s-tQY 

If t = Tf, then t + A = F{t) = Tf^^. Summing for j = 0, . . . , - 1 we thus 
obtain 

N = il + O{bto)) — 2ds>(l + 0(6to)) / — -T^ds, 

Jq a + 6(s-to) Jo a + o(s-to) 

and similarly, omitting j = — 1 , 

AT _ 1 < (1 + O(6to)) / —2 d^. 

Jq a + b{s-toy 

The result follows, using the change of variable s = x -\-tQ. □ 

Proposition 10.4. Suppose that r>2 and n^^ <^p<^ n"^/**. Assume 
a/oc — )■ 1 and a — a* » -v/oT- Then, 



^/r — I \a — a, 

Proof. By (8.1) and (3.4), n7r(3tc) = 0(tc). Hence, by Lemmas 7.3 
and 9.4, for t < 3tc, 

(10.3) S{t) =ES{t) + Op(ty^) = (n - a)n{t) + Op{tl^^). 

Let H{t) := o + (n — a)7r(t) — t and define h := infj<3t^ H{t). Let the infi- 
mum be attained at t*; it follows from (3.17) and Lemma 8.4 that t^ ~ tc', 
cf. (9.2). We have H{t^) = h, H'{t^) = and, uniformly for t < 3tc, us- 
ing (3.14), (3.5) and (3.4), 

H"{t) = {n- a)n"it) = (n - " p) e"^* 

rJl^t 1^ rn^ V- ^-l + o(l) 

^-^^ + o(l) + ofl^-^^ 



Hence, by a Taylor expansion, for < i < 3tc, 
(10.4) H{t) = h + ^^{t-t,)^(l + o{l)+0 



my 
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Notice that in the last two formulas, the term o(l) tends to as n— t-oo, 
uniformly in t < 3tc, and 0{- ■ ■) is uniform in n; these uniformities allow us 
to combine the two terms in a meaningful way. 

On the interval [0,3tc], ^{t) = o(l) by (3.17) and (3.5), and thus by (3.16) 

, . , H{t) . „ a + {n - a)^{t) - t 
n ~ mi — = mi 



t<3tc 1 - Tvit) t<3tc 1 — 7r(t) 

(10.5) 

. mrjt) - 1 

= a + mi -— = a — Or . 

t<3tc 1 — 7r(t) 

1/2 

In particular, by our assumption, /i » Oc . Consequently, by (10.3) and 
(10.4), for any fixed small e > and \t — tc\ < 2etc, w.h.p. 

A{t) -t = a + S{t) - t = H{t) + Op(/i) 

(10.6) 

= (l + 0{e))(h+'-^{t-uf 

Let ti := (1 - £% ~ (1 - e)tc and t2 := (1 + e)t* ~ (1 + e)tc- For < t < ti 
and t2 <t < 3tc, Lemmas 8.1 and 8.4 imply that w.h.p. A(t) — t> etc, for 
some constant c = c(e) > 0. The numbers of generations required to cover the 
intervals [0,ti] and [t2,3tc] are thus 0(l/c(e)), so r(3tc) = + 0(l/c(e)), 
where is the number of generations needed to increase the size from at 
least ti to at least t2- To find r^, we may redefine T„ by starting with 
To := ti and iterate as in (10.1) until we reach t2- (Note that since A is 
increasing, if we start with a larger Tq, then every r„ will be larger. Hence, 
to start with exactly ti can only affect by at most 1.) By (10.6) and 
Lemma 10.2, we may on the interval [ii,t2] w.h.p. obtain upper and lower 
bounds from F±{t) =t + {l±Ce){h + b{t-t^)^), where 6 := (r-l)/(2tc) > 
and C is some constant. Let to := t^ —ti = et* > 0. We have a* ~ Oc and by 
assumption a ~ Oc, so by (10.5), h = o(ac) = o(tc) and thus h < to/2 for 
large n. Furthermore, bto = 0{et^/tc) = 0{e). If e is small enough, we thus 
have bto < 1/2 and, by a translation t t — ti, Lemma 10.3 applies to 
both and F_ and yields, w.h.p., using (10.5), 
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Since tc/{a — a*) — )■ oo, it follows that for every e > 0, w.h.p., with c'(e) := 
min(c(e),e) > 0, 

r(3tc) = + 0(l/c(e)) = (1 + 0(e))^^ + 0(l/c'(e)) 



7r^/2 + 0(e)/ t 



1/2 



•y/r - 1 V ~ 
The result follows since e > is arbitrary. □ 

Remark 10.5. In the critical case (a — a*) j ^fal — ?• y G (— oo, oo), we can 
use a minor variation of the same argument, now using Lemma 9.3, where 
/i ~ a — ttc above is replaced by the random 

= a - ac + V(l/r) + o^{^J'') = r'^/h'J\yV^ + ^ + Op(l)), 

where ^ ~ iV(0, 1). We have T(3tc) < oo A* > 3tc /i' > 0; this is 

w.h.p. equivalent to yy/r — 1 + ^ > 0. [This thus happens with probability 
$((r — l)^/^y) + o(l), as stated in Theorem 3.6(iii).] The argument above 
then shows that conditioned on r(3tc) < oo (i.e., on A* > 3tc), 

r{3t.)/t\" A (^^^(e + yV^r'" I e + yV^i > o 

In particular, then r(3tc) = Qp{t\^^). 

Note that in the supercritical case in Proposition 10.4, the time r(3tc) is 
always smaller than t^^, but that it approaches the order t\^^ when a — a* 

1/2 

grows only a little faster than the critical value Oc . Hence, we can say that 
the worst possible number of generations to pass the bottleneck at tc is of 
the order t^^. 



10.2. The doubly exponential growth. We next consider the growth from 
size 3tc up to 1/p. We will show that in this range, the growth is doubly 
exponential. Again, we approximate A{t) by deterministic functions. 

Define for any 5 G E [cf. (3.4)], 

(10.7) Fs{t):=J^{l + 6) = (^£l^ '^(1 + ^)- 

Lemma 10.6. For every 6 > 0, there are positive constants e and K such 
that w.h.p. F^s{t) < A{t) < Fs{t) for all t £ [K{tc + a),e/p]. 



44 JANSON, LUCZAK, TUROVA AND VALLIER 

Proof. By (8.1) and (10.7), for Ktc <t< e/p (with e < 1), if n is large 
enough so tc>l, 

vr(t) = ^(1 + 0{e + K-')) = -Fo(t)(l + 0{e + K~')). 
r\ n 

We may thus choose e and K such that for ah such t (and large n) 

(10.8) F^s/A{t)<m:{t)<Fs,^{t). 
For t > K{tc + a), (10.7) implies 

\tcj r r 

so choosing K large enough, we have a < {5/4:)F(){t) for all t £ [K{tc + a), 
e/n], and thus by (10.8) 

F^s/i{t) - a < E5(t) = (n - a)Ti{t) < Fs/^{t) < Fs/2{t) - a. 

Hence, by Chebyshev's inequality, using (2.13) and (10.8), 

P{A(t)^[F_3,/4(t),F35/4(t)]} 

(10.9) =F{S{t) i [F_35/4(t) -a,F3,/4(t) - a]} 

^ mrit) ^ Fs/i{t) _ 16(1+V4) 
- (5Fo(t)/4)2 - (5Fo(t)/4)2 - 6^Fo{t) ' 

Define tj := (1 + 6/by/''K{tc + a). Then, (10.9) and (10.7) show that, 
assuming as we may 5 <1, 

20 

(ii),^35/4(ii)]} < X] ^2p U.\ 
j>0:tj<e/p j>0 ■'^ 

20 

^ 100(1 + V5) 

<53Fo(to) ' 

since, using (10.7) again and (3.5), 

Foito) = Fo{K{tc + a)) > Fo(tc) = - ^ oo. 

r 

Consequently, w.h.p. A{tj) E [-^-35/4(^^)5 -^35/4 (ij)] for all j > with < 
e/p. However, if < t < tj+i, then Fo{tj) < Fo{t) < Fo{tj+i) = {l + 6/5)Fo{tj) 
and it follows that, since both A{t) and FQ{t) are monotone, w.h.p. 

(1 + 5/5r'F^,s/4it) < Mt) < (1 + S/5)F,s/4{t) 

for all t G [K{tc + a), (1 + 5/5)~^/''e/p], which, provided 6 is small and e is 
replaced by e/2, say, yields the result. □ 
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Proposition 10.7. Suppose that r>2 and n ^ <^ p <^ n ^1"^ . Then 
w.h.p., when A* > St^, 

t{1/p) -r(3tc) = loglog(np) -log+log — ) +0(1). 

Proof. Choose a fixed < 6 <l, and choose e and K as in Lemma 10.6. 
(In this proof, we do not have to let 5 — >• 0, so we can take 6 = 1/2, say.) 
First, T{K{tc + a)) — r(3tc), the number of generations from 3ic to K(tc + a), 
is w.h.p. 0(1). Indeed, after T{3tc) generations we have at least max(3tc,a) 
active vertices, and in each of the following generations until well beyond 
K{tc + a), the number is w.h.p. multiplied by at least 1.3, say, by the proof 
of Lemma 8.2 or by Lemmas 9.1, 9.4 and 7.3. Similarly, r(l/p) — T{e/p) < 1 
w.h.p., arguing as in Case 3 of the proof of Lemma 8.2. 

Consequently it suffices to consider T{e/p) — T{K{tc + a)). We define iter- 
ates Tp as in Section 10.1 by Tf^^ := Fs{Tp), j > 0, but now starting with 
Tp ■.= K{t^ + a). Further, let 

(10.10) Ns := mm{j > : T^^ > e/p}. 

By Lemma 10.6 we may assume that F_s{i) ^ A{t) < Fs{t) for all t G [K{tc + 
a), e/p], and then, by induction as in Lemma 10.2, Tj < Tj^^^^^f^^.^)) < 

Tf_l-^ for ah j > with Tj„i+r(E-(ic+a)) < ^/P- Consequently, w.h.p. 

(10.11) N_s > T{e/p) - T{K{tc + a)) >Ns-l. 
To find Ns, rewrite (10.7) as 

Fs{t) ( t 



cstc \cstc, 

where cs := {r/{l + 6)y/^^'~^^ . Iterating we see that, for j > 0, 
and thus 



cstc \cstcj V c^i, 



and 



Cstc J V Cstc 



J log r = log log — — — log log 



Consequently, 
(10.12) Ns = 



cstc 1 V 



log log I ^ ) - log log ( ^^^^J""' ) ) / logr 



c^ic / V cstc 
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In order to simplify this, note that, using (3.1), 

(10.13) log(^) =l°s(^) +0(l) = ^log(np) + 0(l) 
and thus 

(10.14) log log ( — ) =loglog(np) + 0(1). 

ycstcj 

Further, we may assume that a > ac/2 > tc/4, since otherwise the process 
is subcritical and A* < 3tc w.h.p. by Theorem 3.1. Hence, log{K(tc + a)) = 
log a + 0(1) and thus, since also log(c5tc) = logOc + 0(1), 

(10.15) logf =loga-logac + 0(l) = log — + 0(1). 

V cstc J ac 

We may assume that K > ecs, so log{K(tc + a) /{cstc)) > 1, and then (10.15) 
yields 

(10.16) log log f:^^^^) =log+log- + 0(l). 

V cstc ) ac 

Finally, (10.12), (10.14) and (10.16) yield 

iV^logr = loglog(np) — log , log — + 0(1). 

Note that the right-hand side depends on 6 only in the error term 0(1). 
Hence, we have the same result for N^s, and the result follows by (10.11) 
and the comments at the beginning of the proof. □ 

10.3. The final stage. We finally consider the evolution after 1/p vertices 
have become active. We let, as in Section 8, b* := bcU}{n) where Lo{n) — >• oo 
slowly; we assume that 6* <C 1/p [which is possible since pbc — )■ by (3.5)]. 
By Remark 8.3, r(n — b*) < t{1/p) + 3 w.h.p., so it suffices to consider the 
evolution when less than b* vertices remain. 

Let J-f ■= c{/j(s) : 1 < i < n, 1 < s < t} be the c-field describing the evo- 
lution up to time t. 

Lemma 10.8. For any t and u with <t < t + u < n, the conditional dis- 
tribution of A{t + u) — A{t) = S{t + u) — S{t) given Tt is Bin(n — A(t), 7r(t; u)), 
where 

(10.17) "<' + ">-'"> . 

1 - 7r(t) 

If further n — b* <t <t + u<n, then, uniformly in all such t and u, 

(10.18) ■K{t]u)=pu{l + o{l)). 
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Proof. Conditioned on J^t, A{t) is a given number, and of the n — a 
summands in (2.8), n — a — S{t) =n — A{t) are zero. For any of these terms, 
the probabiUty that it changes from at time t to 1 at time t + u is, by (2.11), 

P(y.<t + n|y.>t) = '^(^^'^-^^ + ^^ = "(^ + "^-"(^) =vr(fn) 

Hence, the conditional distribution oi S{t + u) — S{t) is Bin(n — yl(t), 7r(t; u)). 

To see the approximation (10.18), note first that for n — b* <t < n, since 
we assume pb* — )• 0, we have 6*<Cl/p<Cra so t ^ n. Hence, using again 
pb* — >• and recalling the notation h'^ from (3.7), 

^(t + 1) - ^(t) =nYi=t+i) = (l\p'\i- p)*+i-^ 

(10.19) 

[r — Ij! n 
Furthermore [cf. (3.7)], still for n — b* <t < n, 

1 - 7r(t) = F{Bm{t,p) < r - 1) ~ P(Bin(t,p) = r - 1) 

(10.20) 

n^"^ .1, ... b' 



p^-^(l-p)" = ^. 



(r — 1)! n 
Consequently, 7r(t + u) — 7r(t) = (1 + o{l))upb'^/n and 

vrft + n) — 7r(t) , , ^^upbi/n , , 

-(t; n) = ^ i_t(t) = (1 + '^^^^^j;r = + □ 

Lemma 10.9. Suppose that r > 2, ^ p ^ n"^/'' and a = o(n). // 
6c — ?• oo and n — b* < t < n, then A{t) = n — bc{l + Op(l)); in particular, 
n - A{t) < 26c 

Proof. We have, using (2.12) and (10.20), since 6c — ^ oo implies 6^ ~ 6c, 

6' 

E(n - Ait)) =n-a - E5'(t) = (n - a)(l - 7r(t)) ~ (n - a)— ~ 6c 

n 

and similarly, using (2.13), 

Var(n - A{t)) = Var S{t) <{n- a)(l - 7r(t)) ~ 6c. 
Thus, by Chebyshev's inequality, since 6c — oo, 

n - A(t) = (1 + o(l))6c + Op(6c'/') = (1 + Op(l))6c. □ 

Proposition 10.10. Suppose that r > 2, <C p ^ n"^/*" and a = 
o{n). Then, when A* > 3tc j 

r-r(l/p) = (l + o(l))i^ + Op(l). 

np 
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In particular, if further p > clog(n)/n for some n>0, then r — r(l/p) = 
Op(l)- 

Furthermore, when A* = n, w.h.p. r — r(l/p) < 3. 

Proof. By Remark 8.3, after r(l/p) + 3 generations, the active size is 
T^(i/p)+3 >'^-^* w.h.p. 

If 6c — ^ 0, we can choose h* = 1/2, so w.h.p. T^(i/p)_|_3 = n and r < t{1/p) + 3. 
More generally, if 6c = 0(1)) we have by (10.19), 

mSin) - Sin - 6*)) < uMn) - 7r(n - 6*)) ~ n6*^ 

n 

= p6*6c = 0(p6*) = o(l). 

Hence, w.h.p. S{n) = S{n — 6*), which means that no further activations 
occur after n — b* . Consequently, in this case too, w.h.p. r = r(n — 6*) < 
t(1/p) +3. In particular, this proves that r < r(l/p) + 3 w.h.p. when A* = n, 
since w.h.p. yl* < n if 6c — )• oo by Theorem 3.2. 

Further, when 6c = 0(1), (3.10) implies that np > logn for large n, so 
logn/(np) < 1, and the result holds in this case. 

Now assume that 6c — )• oo. For convenience, we modify the counting of 
generations and start at t = n — b*, regarding the active but unused vertices 
at n — b* as "generation 0." (We may assume that 6* is an integer.) Thus 
define, recursively, 

T^:=n-6*, 
T;^,:=A{T;), j>0, 

^j^=t;^i-t; = a(t;)-t;, 

t' :=max{j > 0: Aj > 0}. 

Since w.h.p. T^(i/p)_i < max(l/p, a) < n — b* < r^(i/p)_|_3, it follows by in- 
duction that T^(i/p)„i_|.j < Tj < T^(i/p)_|_3_|_j, j > 0, and thus w.h.p. 

(10.21) r + r(l/p) - 1 < T < r + t{1/p) + 3. 

Consequently, it suffices to estimate r'. 

By Lemma 10.8, conditioned on [i.e., on T' and the evolution up 

to Tj, which in particular specifies A[T'-)], for large n, 

E(A,+i I J-^r;) = {n- ^(Tj))^(Tj; A,) < (n - ^(T^))2pA,- 
and thus, by induction, since Aq <n — TQ = b*, 

(10.22) E(A,- \J^T!,)< (2(n - ^(r^))p)^Ao < (2(n - A(T^))p)^6*. 

Further, Lemma 10.9 yields n — A{Tq) = n — A[n — 6*) < 26c w.h.p. Conse- 
quently, (10.22) implies, w.h.p. for all j > (simultaneously), 

(10.23) E(Aj |J^r/)<(4^j6c)^6*. 
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Recall that pbc — )• by (3.5), so we may assume 4p6c < 1- If J is chosen such 
that {Aph^yb* — ^ 0, then (10.23) implies that w.h.p. Aj = and thus r' <j. 
Hence, for any oo' = uj'{n) — )• oo, w.h.p. 

|log(p6c) + log4| 
which is another way of saying [33], Lemma 3, 

(10-24) r' < ,^ ,1 + Op(l) = ,/°f^* , (1 + o(l)) + Op(l). 

|log(p&c) + log4| |log(p&c)| 

For a lower bound, fix e with < e < 1, and define the deterministic 
numbers A J by 

(10.25) AJ :={l-ey+\pb,yb*. 
Let uj" := l/{pbc) — s- oo. We claim that w.h.p. 

(10.26) Aj > AJ for all j>0 such that Aj > uj" . 

By our assumption Apbc < 1, we have Aj^j^/Aj < 1/4, so Aj — )• geomet- 
rically fast. 

By Lemma 10.9 and b^/b* — )• 0, w.h.p. 

Ao = A(r^) - (n - b*) = A(r^) - n + 6* > 5* - 26c > (1 - e)b* = A^ , 

so (10.26) holds w.h.p. for j = 0. 

Say that j > is good if Aj > Aj and fat if ^(Tj) > n - (1 - e/4)6c- Let 
j > 0. At time Tj we have A{Tj) — Tj = Aj active but unused vertices. Fur- 
ther, by Lemma 10.8 we have, conditioned on J-'rp> (which specifies both T' 
and Aj), 

A,+i = T;+2 - T'j+i = A{T; + A,) - A{T;) G Bin(n - A{T;),n{T;; Aj)). 

By Lemma 10.8, 7r(rj; Aj) =pAj{l + o(l)) > pAj(l - e/4) for n large, so 
if j is good but not fat, 

E(A,+i I J^T,) = {n- A{T'j))TT{T'j; Aj) > (1 - e/Afb,pAj 

>(l-e/2)6cpA->(l + e/2)A-^_i 

and Chebyshev's inequality yields, since x i— )■ x/(x — a)^ is decreasing for 
x > a, 



Var(Aj+i I Jt') 

P(A,+i<AT^J J-^,)< 



^+1' (E(A,h.i|J-^,)-At^^)2 
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E(Aj+i I Tr, 



< 



3 



j+i) 



\2 



{l+£/2)A- 
< '—^±1 = 



Say that j is bad if j is not good and that j fails if j is fat or bad. Then, 
by stopping at the first j that fails we see that 

P(some j < oj" fails) < P(some j < oj" is fat) + P(0 is bad) 

+ P(j is bad I j — 1 is good and not fat) 



i>0: A->a;" 



< ¥{A{n) > n - (1 - e/4)6c) + o(l) 

j:AJ>uj" ^ 

since A{n) < n — (1 — e/4)6c w.h.p. by Lemma 10.9 and the final sum is 
0{l/uj") = 0(1) because the terms 1/A~ increase geometrically, so the sum 
is dominated by its largest (and last) term. 

We have shown that w.h.p., if A~ > a;", then Aj > Aj > and thus 
T'>j. Hence, by (10.26) and (10.25), w.h.p. 

log((l - e)b*/J') 



(10.27) r' > 



|log((l-e)p6c)| 



(1 + 0(1)) + 0(1). 
|log(p6c)| 



Combining the upper bound (10.24) and the lower bound (10.27), we find 

By (3.3), log(p6c) = —{np — r\og{np) + 0(1)) and 

logn > logft* > log 6c ^ log?! — pn — 0(1). 
Hence, finally (10.28) yields 

np — r log(np) + 0(1) 
The result now follows from (10.21). □ 

11. Proofs of Theorems 5.2, 5.6, 5.8. We prove in this section Theo- 
rems 5.2, 5.6 and 5.8 related to the boundary cases. We consider first the 
case p ~ c/n. 
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Proof of Lemma 5.1. By the implicit function theorem, at least lo- 
cally, the root xq{9) is smooth except at points where 

(11.1) f{x,c,e) = ^fix,c,e) = o. 

We begin by studying such critical points. 

Let g{y) := P(Po(y) < r — 1) = 1 — ^p{y)■, cf. (3.13). Differentiations yield 

r— 1 

(11.2) g'{y) = -F{Fo{y)=r-l)^ ^ 



(r-1) 



(11.3) g"{y) =1^-1 ]g'{y) = !-J-^g'{y). 



We have [see (5.2)] f{x,c,9) = 1 — x — {1 — 9)g{cx) and thus -§^f{x,c,6) = 
— 1 — c(l — 0)g'{cx). Hence, (11-1) holds if and only if 

{l-e)g{cx) = l-x, 
c(l - %'(cx) = -1, 

which imply g{cx) = — c(l — x)g'{cx) and thus 

g'{cx) 

Let h{y) :=y- g{y)/g'{y), y > 0, so (11.4) says c = h{cx). Then, by (11.3), 

h'( ) = ^ ^'^'^^ 9{y)g"{y) ^ r-l-y g{y) 
9'{y) 9'{y? y ' g'iy)' 

Since g{y) > and g'{y) < for y > 0, h has a global minimum at y = r — 1, 
and the minimum value is 

w X gir-l) P(Po(r- 1) <r-2) 

mmh(y) = h(r - 1) = r - 1 - {- =r + — ) '-^ f = c^. 

y>0 ^ ^ g'{r-l) P(Po(r - 1) = r - 1) 

Furthermore, h(y) > y — )• oo as y — )■ oo, and h{y) — )■ oo as y — )■ too, because 
then g{y) — > 1 and g'{y) — )■ 0. 

Consequently, if < c < Cc, then (11.4) has no solution x > 0, and thus 
there is no critical point. If c = Cc, there is exactly one x > satisfying (11.4) 
[viz., x = {r — l)/cc], and if c> Cc, there are two. Since (11.4) implies c > cx, 
these roots are in (0, 1). 

To complete the proof, it is perhaps simplest to rewrite (5.4) as ^ = '(?(x), 
with 

(11.5) 'd{x):=l- {1- x)/g{cx). 

Since g{y) > for y > 0, i9 is a smooth function on [0, 1], with i9(0) = and 
i9(l) = 1. Moreover, f{x,c,0) = g{cx)[9 — 'd{x)), which implies that 

d 

f{x,c,9) = —f{x,c,e) = ^ = i9(x) and t?'(x) = 0. 
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Consequently, by the results above, if c < Cc, then ^ so ^'{x) > for 
rr > 0. In this case, ?? is strictly increasing and thus a bijection [0, 1] — )• [0, 1], 
and xq is its inverse. 

If c = Cc, then = only at a single point, and it follows again that i!) is 
a strictly increasing function and xq is its inverse. 

If c> Cc, then 'd'{x) = at two values xi and X2 with < xi < X2 < 1 and 
cxi < r — 1 < CX2- It can be seen, for example, using (11.3), that < 
< '&"{x2), and thus i9 is decreasing on the interval [xi,2;2]. The result 
follows, with 6c = 6~ = max(t?(x2), 0) and xo(^c) = xi. [Note that 

^(2:2) = niiii2;g[o,i] i^ix) < if c is large enough.] □ 

Remark 11.1. If c > Cc, then thus xq{9c) = xi is the smallest root of 
'^'{x) = 0, or equivalently xi = yi/c where yi is the smallest root of h{y) = c; 
further, 6^ = "^ixi) while xq{9c+) is the other root of 'd{x) = 'd{xi). 

If c = Cc, we have yi = r — 1 and thus xi = {r — l)/cc and, using (11.5) 
and (5.5), 

V-l\ l-(r-l)/cc 



9{r-l) 
1 



o(r - 1) = r - 1) + P(Po(r - 1) < r - 2) ' 

For c > Cc, the two roots xi(c) and X2(c) of 'd'{x) = are smooth functions 
of c, and thus 

d^'c d „ , , , , d-d , , , , d-d , ,,,,,, dd , , 

_1 = ^/(MC)) = g^iMc)) + ^(X1(C))XUC) = g^iMc)) < 0, 

where the last inequality follows from (11.5), and similarly d9~ /dc < 0. 
Hence, 9c{c) and 6~{c) are decreasing functions of c, as claimed in Re- 
mark 5.3. 

Lemma 11.2. Suppose that r > 2, p = 0(l/n) and tp = o(l). Then 
5(t)=Op(t). 

Proof. We may assume 1 < ^ < 1/p. [Note that S{t) = for t < r.] Then 
7r(t) = 0{t'^p^) = o{tp) by (8.1), and thus the expected number of activated 
vertices is E5(f) = (n — o)7r(i) = o{npt) = o{t). □ 

Proof of Theorem 5.2. First, in (i) and (ii), ap^9c = 0. Let e > 0. 
Taking t = (1 + e)a in Lemma 11.2, we find w.h.p. ^((l + e)o) < ea and thus 

A((l + e)o) = o + 5((1 + e)a) < (1 + e)a, 

whence A* = T < {\ + e)a. Consequently, 1 < A* /a < 1 + e w.h.p., prov- 
ing (i) and (ii). 
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Next, by (2.9), Lemma 7.1 and (2.12), uniformly for all t > 0, 

A{t) = a + S{t) = a + E5'(t) + Op(n) = (n - a)7r(t) + a + Op(n) 
and thus, using also (3.15), 

n"^A{t) = (1 - e)Ti{t) + e + op(i) = (1 - e)^t) + e + op(i). 

Substituting t = xn, we find by (3.13), since tp = xc + o{x), uniformly in all 
x>0, 

n~'^A{xn) = (1 - e)F{Po{tp) >r)+e + Op(l) 
= (1 - e)F{Fo{cx) >r)+e + Op(l) 
and, recalling (5.1), still uniformly in x > 0, 

(11.6) n"^(A(xn) - xn) = f{x, c, 6) + Op(l). 

Let e > 0. Since f{x,c,9) > for x € [0,a;o(^)), and thus by compactness 
f(-,c,9) is bounded from below on [0,xq{6) — e], (11.6) implies that w.h.p. 
A{xn) — xn> on [0,xq{9) — e], and thus T > {xq{9) — e)n. Furthermore, 
both in (iii) and in (iv) with 6 ^ 6^, we have ■^f{xQ{9),c,9) ^ and thus 
if e > is small enough, f{xo{9) + e,c,9) < 0, so (11.6) implies that w.h.p. 
A{{xo{9) + e)n) < {xo{9) + e)n and thus T < {xo{9) + e)n. □ 

The proof of Theorem 5.5 is very similar to the one of Theorem 3.6. We 
first give a more precise estimate of the process S{t), which is the analog of 
Lemma 9.3 in the case p = c/n. However, in this case, we get a Brownian 
bridge because here we consider a large part of the distribution of Yi. 

Lemma 11.3. Suppose r>2,p = c/n and 9n with c> and <9 <1. 
Then 

(11.7) z(.):=?iSi^3§^^Wmc^)) 

in D[0,1], where Wq is a Brownian bridge and ip{y) :=¥{Fo{y) > r) as 
in (3.13). 

Proof. Let S{u) := Y17=i HUi < n}, < -u < 1, where Ui € [7(0, 1) are 

i.i.d. By (2.8) and (2.11), we have S{t) = 5(7r(t)), jointly for ah t > 0. Fur- 
ther, ;^5'(ti), u G [0, 1], is the empirical distribution function ofUi,. . . , Un-a, 
and thus by [16], Theorem 16.4, in D[Q, 1], 

yn — a 

Furthermore, by (3.15) and (3.13), 

7r(xn) = Tr{xn) + 0(l/n) = ilj{xnp) + 0(l/n) = tp{cx) + 0(l/n), 
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uniformly for x > 0, and it follows, using the continuity of Wq, that 

S (xn) — E, S (xn) d S(TT(xn)) — K S(-K(xn)) d , ^ 

^= = ^= > Woiipicx)) 

y/n — a yjn — a 

in D[0, 1], which proves the result since n — a~(l — ^)n. □ 

Proof of Theorem 5.5. It suffices to consider a such that ar^Q^n. 
By (2.12), (3.15) and (3.13), 

ES'(xn) = (n — a)'jr{xn) = {n — a)Tr{xn) + 0(1) 

(11.8) 

= (re-a)V'(ca;) + 0(1). 

By the Skorohod coupling theorem ([35], Theorem 4.30), we may assume 
that the processes for different n are coupled such that the limit (11-7) in 
Lemma 11.3 holds a.s., and not just in distribution. Since convergence in 
D[0, 1] to a continuous function is equivalent to uniform convergence, this 
means that a.s. Z{x) — )• Wq{iP{cx)) uniformly for x £ [0, 1]. Hence, we have, 
using (11.8) and (5.1), 

A(xn) — xn 

= a + S{xn) — xn 



xn 



(11.9) = a + (n - a)i){cx) + {I - e^)nZ{x) -xn + 0(1) 



(a - e^n){l - i^{cx)) + n/(x, c, ^c) + ^/ {I - e^)nZ{x) + 0(1) 



= n/(x,c, 0c) + (« - e^n){l - ij{cx)) + - 9^)nWo{ij{cx)) 

+ Op(n^/'), 

uniformly for x G [0, 1] . 

We first use (11.9) to derive the simple estimate 

(11.10) A{xn) — xn = nf{x, c, 6c) + Op(n), 

uniformly for x G [0, 1]. By Lemma 5.1, f{x,c,9c) = for x = xq or x = xi, 
with f{x, c, 9c) > for X G [0, xq) U {xo,xi) and /(x, c, ^c) < for x G (xi, 1]. 
Hence, for any fixed small e > 0, (11.10) implies that w.h.p. A{xn) — xn > 
for X G [0, xo — e] U [xo + e, xi — e] and A{xn) — xn < for x G [xi + e, 1] , and 
hence T G [xq — e, xq + e] U [xi — e, xi + e] . It follows by a standard argument 
that there exists a sequence e„ \ such that w.h.p. 

A* = T £ [xo -e„,xo + e„] U [xi - e„,xi + e„]. 

Moreover, w.h.p. Tg [xq - £«, a^o + en] if and only if inf [^^.g^^a-g+j^] (^(xn) - 
xn) < 0. (We may also assume that e„ is so small that < xq and 2e„ < 

Xl - XQ.) 
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For X G [xq — Sn,xo + En] , we have by (11.9) again, and the continuity of tp 
and Wo, 

A{xn) — xn = nf{x, c, ^c) + (a — 6cn){l — V'(cxo) + o(l)) 

(11.11) , 

+ V(l-^c)nWo(V'(cxo)) + Op(n^/2). 

Further, f{xQ,c,6c) = and f{x,c,9c) > for x G [xq — en,xo + and 
thus (11.11) yields 

inf {A{xn) — xn) = {a — 6cn){l — ^p{cxo) + o(l)) 

n I 

(11.12) , 

+ ^(1 - 9c)nWo{iP{cxo)) + Op(ni/2). 

The cases (i) and (ii) are easily derived. We thus focus on (iii). We then 
have, from (11.12), 

n~^^^ inf {A{xn) — xn) 

xe[xo-£n,Xo+£n] 



= y(l - ^P{cxo)) + ^/l-9,Wo{^P{cxo)) + Op(l) 

and thus, since (1 — '^p{cxo))~^^/l — 9cWo{^l'{cxo)) G N{0,a'^), where cr^ 
{l-9c)i^{cxo)/{l-i^{cxo))>0, 



inf {A[xn) — xn) < 

x£[xo-£n,XO+£n] 



= F{y{l - V^(cxo)) + V^-OcWoi^{cxo)) < 0) + Op(l) 
= l-$(y/c7) + 0p(l). 
The result follows. □ 

To prove Theorem 5.6 (p~ cn~^^^), we first show using the previous re- 
sults that if we can activate co{n) — )• oo vertices, then the activation spreads 
w.h.p. to the entire graph. It remains to show that starting with a finite 
number of active vertices, the process activates uj{n) vertices with a prob- 
ability bounded away from and 1. This will be done using a branching 
process argument. 

Lemma 11.4. Suppose that p > cn~^^''' for some c > 0. // uj{n) — ?• oo, 
then w.h.p. A{t) > t for all t with uj{n) <t<n—l. 

Proof. This is easy to prove directly, but we prefer to view it as a corol- 
lary of our estimates for smaller p. Thus, let p' := uj{n)~^/'^^n~^/^' . We may 
assume oj{n) < n and then n~^ <^p' <^ n~^/^ , so p' < p, at least for large n, 
and we may assume that G„y Q Gn,p- We may consider bootstrap percola- 
tion on Gn,p' and Gn,p simultaneously, with the same initial set Aq of size a; 
we use the description in Section 2, starting with families of i.i.d. random in- 
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dicators /j'(s) G Be(p') and Ii{s) € Be(p) where we may assume /j'(s) < Ii{s). 
Then, using ' to denote variables for Gn,p', S'{t) < S(t) and A'{t) < A{t). 
We apply Lemma 8.2 to G^y. The critical time for G„y is [see (3.1)] 

t'^ = 0((n(y)'-)-i/("-i)) = w(n)i/2(-i) = o{u{n)). 

Further, p' > ■nT^I'^''' > n~^^^ so, by (3.3), b'^ — >• 0, and we may choose b*' 
with b*' 0. Hence, Lemma 8.2 shows that w.h.p. A{t) > A'{t) > t for t £ 
[3t'^,n — b*'] , and the result follows since, for large n, 3t'^ < u){n) and n — b*'> 
n-1. □ 

Proof of Theorem 5.6. For (ii), we apply Lemma 11.4 (if necessary 
with a smaller c). Taking u(n) = a, we see that w.h.p. A{t) > t for all t G 
[a,n — 1]. Since also A{t) > a, we have A(t) > t for all t <n — 1, and thus 
A* = T = n. 

For (i) suppose r > 2, p~ cn~^^^ and let a>r be some constant. The 
probability that a vertex is activated at a given time k is by (2.7) 

(11.13) F{Y, =k)= }) /(I -pf-"- ~ : 1) ^- 

For any fixed the random variables 

Xk:=A{k)-A{k-l) = S{k)-S{k-l)= ^ l{Yi = k}, 

k = 1, . . . ,K , form together with 

XK+i:=n-a-A{K)= ^ l{Yi>K} 

a random vector with the multinomial distribution Mul(n — a, {pk)k=i) with 
Pk = ¥{Yi = k), k<K, andpK+i=nYi>K + l). By (11.13), {n-a)pk^ 
[^~\)c'' for k < K, and it follows that X^ for k < K have a joint Poisson 
limit, 

(11.14) (Xfc)f=i A (6)f=i witha.GPo(^(^J:2j)c^) independent. 
Using the notation of Remark 5.7 we thus obtain 

k 

A{k) — > a + S^j = a + k + Sk for A; = 1, . . . , t jointly 

i=i 

and thus P(r = k) ^ F{f = k) ior k < K and P(r > K) ^ P(f > K). 

Since K is arbitrary, we have shown ¥{A* = k) = P(T = fc) — )• P(T = k) = 
C(o, c, /c) for every finite k>l. Furthermore, P(r > K) - F{f > K)-^0 for 
any fixed K, and a standard argument shows that there exists a sequence 
Krr^-^ oo such that P(r > Kn) - P(r > Kn) 0, and thus P(r > i^„) ^ 
P(T = oo). On the other hand, Lemma 11.4 with uj{n) = Kn shows that 
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P{Kn < T < n) ^ 0. Consequently, F{T = n) = F{T > + o(l) F{f = 
oo) = C(a,c). 

It is clear that C,{a,c,k) = P(r = /c) > for every k > a. To see that 
also C(a,c) = P(T = oo) > 0, note that, see (11.14), E^fc = (^Zi)c^ ^ oo as 
A; — 7- oo. Hence, there is some Kq such that E^^o > Since stochasti- 
cally dominates (^Kq for ^ ^ -^O) it follows that if the process reaches Kq 
without stopping, the continuation dominates (up to a change of time) 
a Galton- Watson branching process with offspring distribution S^Kq, which 
is supercritical and thus has a positive probability of living forever. Hence, 
P(f = oo)>0. □ 

Proof of Theorem 5.8. It suffices to consider a = r. Thus assume a = 
r, and consider the vertices activated in the first generation, that is, at time 
t = r. There are S{r) S Bin(n — r,p^) such vertices. [Note that, see (2.11), 
7r(r) = P(Bin(r,p) = r) = p'^ .] Consequently, ES'(r) = (n — r)p''' — ?• oo. Let 
uj{n) =ES'(r)/2, so oj{n) — )• oo. It follows from Chebyshev's inequality (or 
Chernoff's) that w.h.p. S{r) > uj{n). Hence, w.h.p. for all t G [r, a;(n)], A{t) > 
A{r) > S{r) > uj{n) > t. Together with the trivial A{t) > a = r > t for t <r 
and Lemma 11.4, this shows that w.h.p. A(t) > t for all t < n — 1, and thus 
A* = T = n. □ 
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